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large cross-sectional dimensions required. At infrared or optical frequencies, 
they again become impractical since the cross-sectional dimensions become 
too small. As a result, the microwave portion of the spectrum is the only 
region in which ordinary waveguides can be used in practice. 

In ordinary waveguides, there are no separate conductors between which 
the voltage can be defined and the concept of power expressed by the product 
of voltage and current is no longer directly applicable. Consequently, the 
electromagnetic field itself has to be investigated first rather than the voltage 
and current which, in a sense, represent the field. In general, it is hopelessly 
difficult to solve Maxwell's equations under appropriate boundary conditions 
and to analyze the electromagnetic field of complex waveguide systems en- 
countered in practice or in laboratories. Even if a formal solution is easily 
obtainable, the interpretation of the result may become so complicated that 
no usefull information can be extracted from it. In conventional circuit 
theory, the resistance, capacitance, and inductance are defined without 
specifying their physical structures or materials used. The relations between 
the voltage and current at the terminals of such elements are first clarified. 
Then, the properties of a complex network are studied as a combination 
of the effects of each element making up the network. There is no need to 
solve Maxwell’s equations explicitly and yet much useful information is 
obtainable. In a similar fashion, it is possible to study each discontinuity or 
junction of waveguides separately and to clarify its effect on the propagating 
modes of the electromagnetic fields either by experiments or by solving 
Maxwell’s equations. The behavior of a complex waveguide system can then 
be discussed as a combination of the effects of such elements. By this ap- 
proach, a better understanding can be obtained with less difficulty than 
trying to solve Maxwell’s equations directly for the whole system. The 
theory of microwave circuits is the study of this approach. The word circuit 
comes from the similarity existing between this approach and that of con- 
ventional circuit theory. The concept of voltage and current can be esta- 
blished by observing the fact that the transverse electric and magnetic fields 
in a waveguide vary with distance in the same manner as the voltage and 
current do along a conventional transmission line. Using this concept, 
impedance can be defined and power appears as the product of voltage and 
current. Many other useful theorems in conventional ac circuits also become 
applicable to waveguide systems. 

In our category of microwave circuits, we shall include any electromagnetic 
phenomena in the inside of a hollow region enclosed by conducting walls. 
We are especially interested in the study of their effect on the propagating 





modes of electromagnetic fields in waveguides connected to the hollow region. 
The actual wavelength is not important; it can be longer or shorter than the 
centimeter or millimeter ranges as long as the electromagnetic field is 
confined in a finite region. On the other hand, we shall not discuss waves in 
free space as in radio and in optics nor those along a dielectric rod or coated 
wires in open space. 

In Chapter 1, some of the topics in the conventional circuit theory are 
reviewed, which have particular importance in the theory of microwave 
circuits. These include the theory of transmission lines, bilinear transforma- 
tions, and power waves. Chapter 2 gives a review of vector analysis and the 
fundamental properties of electromagnetic fields to facilitate our later study. 
In Chapter 3, waveguides are discussed for the first time. Here, the eigenvalue 
problem is studied in detail including the completeness of eigenfunctions. 
Without the discussion of the completeness, the theory is only partially 
correct since no answer is given to certain fundamental questions such as 
why an exponential variation with distance is assumed for each mode and 
no other possible functional form is considered. Chapter 4 discusses resonant 
cavities using a similar eigenfunction approach. In Chapter 5, various 
properties of waveguide junctions are discussed using matrices. 

Matrices are introduced since the multitude of symbols and lengthy 
expressions, which would otherwise be necessary, could interfere with our 
concentration on the heart of a problem. Just as vectors do, matrices help 
in studying relatively complex problems, systematically. Chapter 6 discusses 
two phenomena : One is the coupling between traveling waves and the other 
is that between electromagnetic fields in cavities. Each illustrates an in- 
teresting application of eigenvalue problems. Chapter 7 is a study of linear 
amplifiers. The central topic is their noise performance. In Chapter 8, we 
discuss a circuit-theoretical analysis of electron beams which clarifies a 
number of properties of practical interest. Finally, Chapter 9 contains a 
discussion of oscillators. Although oscillators are inherently nonlinear, some 
of their important properties are studied here without specifying the details 
of the nonlinearity. 

Each chapter is followed by a set containing two types of problems: one 
helps to give deeper understanding of the discussion in the text while the 
other covers topics which could not be included in the text. The reader is 
advised to at least read each problem even when he has no time to solve it. 

Obviously, this book is not intended to supply microwave circuit design 
data. Those who want such information should refer to a large collection 
of books already available on the market (for example, MIT Radiation 
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Laboratory series). The objective of this book is to convey to the reader, 
through the discussion of the theory of microwave circuits, some basic ideas 
and a few mathematical techniques, which are found to be useful in many 
branches of engineering and science. 

In a classroom, since only a few percent of the students will be involved 
with waveguides and related subjects in the future, there is no reason why 
details of presently available design techniques should be presented since 
these will soon be obsolete. On the other hand, b?sic ideas will ultimately 
prove to be more useful for solving problems in other fields in which the 
student may become involved. As a textbook, however, it is not necessary 
to follow each chapter closely. For example, one could start from Chapter 
3 depending on the background of the average student. Furthermore, the 
teacher does not need to cover the eigenfunction completeness discussion in 
the classroom; he has only to stimulate the students’ interest. They may read 
the appropriate parts by themselves or refer to them when a similar problem 
interest them. Similarly, many other parts can be bypassed in the classroom 
or presented in a simplified form by restricting the discussion to a special 
case; for example, the reference impendances can be made equal to 1 in the 
scattering matrices of Chapter 5. 

The current volume is based on a first-year graduate text written in Japan- 
ese and published by the Maruzen Company in 1 963. A number of revisions and 
additions were made before the book was translated in this final form. The 
author’s cordial thanks are due to the publishing company for allowing 
him to use material freely for publication. 

The English text has been read by many colleagues at Bell Telephone 
Laboratories. Among them, the author is particularly indebted to Dr. 
M. R. Barber who accepted the burden of correcting the English. Without 
his help, this book would not have come into existence. 
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CHAPTER 1 

ELEMENTS OF 

CONVENTIONAL CIRCUIT THEORY 



This chapter reviews several topics in conventional ac circuit theory 
which are of particular importance in the theory of microwave circuits. 
They are the theory of transmission lines, Smith chart, bilinear transfor- 
mations, and power waves. In the discussion of the transmission lines, 
special emphasis is given to explaining a standard approach in natural 
science in which an idealized model in one sense or another is constructed 
and studied in detail. Not only in microwave circuits but also in almost 
every branch of physics, it is important to construct an appropriate model 
and to study its behavior in order to obtain a better understanding of the 
actual situation, which is always far more complicated and difficult to study. 
Another technique explained in connection with the transmission line theory 
is that a change in viewpoint of a problem sometimes makes the solution 
considerably easier. A mathematical operation called linear transformation 
is one of the methods to change a viewpoint. Since the same kind of oper- 
ation will be repeatedly employed later in this book, use is explicitly made 
of a linear transformation, and its meaning and merits are explained. From 
such a discussion, it will become evident that a problem can become easy or 
difficult to understand depending on the way it is approached. It is, therefore, 
of utmost importance for us to keep a flexible attitude and to study various 
approaches in order to find the simplest techniques. 



1.1 Transmission Line Theory 

When current i flows through an inductance L 0 , the energy stored in the 
inductance is magnetic and given by \L 0 i 2 . Conversely, if a circuit element 
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stores magnetic energy when current i is flowing through it, the element has 
an inductance given by the stored magnetic energy divided by \i 2 . Similarly, 
if voltage v is applied to a capacitance C 0 , the electric energy stored in the 
capacitance is given by \C 0 v 2 . Therefore, the stored electric energy divided 
by gives the capacitance. 

When direct current / is flowing through a thin wire situated above a 
conductor ground plane to form a transmission line as shown in Fig. 1.1, 
there is magnetic field surrounding the wire and, hence, some stored mag- 
netic energy. Except for a small fringing effect at both ends, the magnetic 
field and, hence, the magnetic energy density is independent of longitudinal 
position z in Fig. 1.1. Thus, the magnetic energy stored per unit length of 




Fig. 1.1. Transmission line above ground plane. 



the line is given by the total stored energy divided by the total length of the 
line. If the energy stored per unit length is further divided by \i 2 , then it 
should give the inductance L per unit length of the line. On the other hand, 
if static voltage v is applied between the wire and the ground plane, an 
electric field appears between them. This means that some electric energy 
is stored in the space. In a similar way to the above, it is possible to define 
the electric energy stored per unit length of the line and, hence, the capac- 
itance C per unit length of the line. Thus, L and C defined are constants 
which are independent of z. 

Next, let us consider the case in which a slowly varying voltage is applied 
to one end of the line instead of static voltage, as before. In this case, one 
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might expect the voltage and current on the transmission line to be functions 
of time t as well as of z. Let us first derive the relations between the voltage 
and current at z and z + Az where Az is a small increment in z. Since the 
inductance per unit length of the line is given by L, a length Az of the line 
has an inductance L Az. The current i(z, t ) flowing into this inductance 
produces potential difference L Az {di(z, t)/dt} between z and z -y Az. The 
current also changes with z. In this calculation, however, it is assumed that 
i(z, /) can represent the current at any point between z and z + Az inclusive 
since Az is small and later it will be shrunk to zero. The partial differentiation 
means that the differentiation is with respect to t alone while the current is a 
function of both z and t. Since the sum of the voltage at z + Az and the 
potential difference between the terminals of the inductance L Az has to be 
equal to the voltage at z, we have 

_ di(z, t) 

v (z, t) = v(z + Az, t) + L Az — - — 

ot 

or equivalently, 

— L Az — — ^ — = v(z + Az, t ) — v(z, t ) (1.1) 

ot 

The difference in the currents at z and z + Az is attributable to the current 
flowing through the shunt capacitance C Az of the length Az of the line. 
The shunt current is given by C Az{dv(z, t)jdt), hence, we have 



i (z, t) = C Az ^ ^ + i(z + Az, t) 

dt 

or equivalently 

-C Az dV ^’ ^ = i(z + Az, t ) - (z, /) (1.2) 

at 



Dividing (1.1) and (1.2) by Az and taking the limits of Az ->■(), we have 



and 



di (z, Q _ dv{z, t) 
dt dz ■ 

c dv ( z > 0 = 

dt dz 



(1.3) 

(1.4) 



respectively. By differentiating (1.3) with respect to z and (1.4) with respect 
to t, i(z, t) can be eliminated. The order of differentiations with respect to t 
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and z can be reversed without changing the final result. Thus, we obtain a 
differential equation for v(z, t ) alone: 



d 2 v(z,t) d 2 v(z,t) 
“ dz 2 



(1.5) 



The problem is now reduced to the solution of this equation with appropriate 
boundary conditions. However, before trying to solve the differential equa- 
tion, let us consider what assumption have been made. There is an endless 
list of neglected phenomena, but some of the more important ones are as 
follows. First, the effect of resistances in the wire as well as in the ground 
plane is completely neglected. The potential difference between z and z + Az 
is therefore assumed to be attributable to the inductance L Az alone. Next, 
if the applied voltage is a function of time, the distribution of current 
density in the cross section of the wire changes from that for the dc case. 
The inductance per unit length of the line is therefore different from the dc 
value, but L is assumed to be a constant in our discussion. Furthermore, 
when the voltage between conductors becomes sufficiently large, corona 
discharge and sparks generally take place, and the shunt current can no 
longer be approximated by CAz{dv(z, t)/dt}. Finally, the complicated 
effect of fringing fields at the ends of the line are ignored. 

Any phenomenon that actually takes place is always too complicated to 
deal with exactly. In order to study it, therefore, a number of things which 
hopefully produce only minor effects are neglected, and an idealized model 
is constructed. The phenomenon which takes place in this simplified model 
is then studied in detail, and the result is compared with the experimentally 
observed behavior in order to understand actual effects and their causes. 
Often the first model may not fully explain the actual phenomenon. In such 
cases, another model has to be constructed which includes one or several 
factors previously neglected, therefore, it is closer to the actual situation. 
Sometimes, the same procedure may have to be repeated several times before 
a satisfactory result is obtained. Selecting an appropriate model constitutes 
the most important part of any study. If the model is too complicated, a final 
conclusion may not be obtainable. On the other hand, if it is too simplified, 
the model may not explain the actual phenomenon at all. Thus, the success 
or failure of a theory is primarily determined by the selection of an appro- 
priate model. 

Now, returning to (1.3)-(1.5), they are the equations which the voltage 
and current of the idealized model must satisfy. The simplicity of these 
equations is attributable to the fact that we ignored many, but hopefully 
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minor, factors in the derivation as explained before. As far as the voltage 
and current of the model are concerned, we have only to consider these 
equations. Therefore, let us forget the actual transmission line for the time 
being, keeping in mind, however, that if the conclusion from these equations 
fails to explain the actual phenomenon, the actual transmission line must be 
reconsidered to obtain an improved model. 

The differential equation (1.5) is linear: (i) If some voltage v satisfies the 
differential equation, then a constant times v, Av, also satisfies the same 
equation; and (ii) If n, and v 2 satisfy the differential equation separately, 
then their sum v t + v 2 also satisfies the equation. It follows from (i) that if 
the voltage applied to the model transmission line is increased by a certain 
factor, the voltage at every point along the line will be increased by the same 
factor, (ii) asserts that the principle of superposition holds for the model. 
For instance, when the applied voltage has a complex waveform, it can be 
decomposed into simpler components, and the line voltage can be calculated 
for each separately. The line voltage corresponding to the original complicated 
applied voltage can then be obtained by summing the results thus obtained. 
Another important point in connection with (1.5) is that the coefficient of 
the differential equation is real, i.e., the constant LC is not a complex 
quantity but rather a real quantity. 

Since Eq. (1.5) is a linear differential equation with real coefficients, we 
can use a special function of time, e i<0 '. Suppose v l = V(z) e JV>t satisfies the 
differential equation, then the complex conjugate v 2 = V*(z)e~ ib>i is also 
seen to satisfy the same differential equation since LC is real. Because of 
the linearity, a constant times their sum as well as their difference will also 
satisfy (1.5): 

V2{F(z) + V*(z) e~ J<ot }l2 = Re(V2 V(z) e J<ot ) 

-j y /l{V(z)e‘ M -V(z)e-^}l2 = im{ y /l F(z)e J “} 

where the asterisk (*) indicates the complex conjugate, and Re and Im 
indicate the real and imaginary parts of the quantity following. The quantity 
yj 2 is introduced in front of V{z) so that V{z) represents the effective value 
as is familiar in the ac circuit theory. The above two quantities are both real 
functions of time, and they can be interpreted as the actual voltages which 
are measurable on the model transmission line. Therefore, let us agree to 
take either the real or the imaginary part of yj 2 e iu>t times V(z) for the 
interpretation of V(z). Since ct> is arbitrary, by the theory of Fourier 
integral, the result can be integrated with respect to co with a proper weighting 
function to obtain the line voltage corresponding to any time-varying 
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applied voltage. For this reason, we shall use e* 0 * as the time factor unless 
otherwise specified. With the time factor e j ° >l , the differentiation with respect 
to time becomes equivalent to the multiplication by jo thereby simplifying 
the calculation. If sin cot is adopted as the time factor, the differentiation 
will produce a term like co x cos o)t and the calculation will not be as simple 
as in the case in which e J(at is used. 

Substituting v = V(z) e jM into (1.5) and dividing both sides by e 701 ', we 
obtain the differential equation for V(z): 

-co 2 LCF(z) = ^l ) (1.6) 

Since V(z) is not a function of t, the partial derivative with respect to z is 
changed to an ordinary derivative. Since (1.6) is a linear ordinary differential 
equation, it has a solution of the form 

V (z) — Ae~ yz (1.7) 

where A and y are constants. The substitution of (1.7) into (1.6) gives the 
condition for (1.7) to be a solution, 

y 2 = - o 2 LC 

y = ±MLC)‘ ll = ±jli (1.8) 

Here, in order to eliminate possible confusion about the sign of square 
roots, let us agree that ( ) l/2 expresses a positive value. The negative root is 
expressed by — () 1/2 . The constant y is called the propagation constant, 
and ft the phase constant. 

From (1.7) and (1.8), it follows that 

V(z) = Ae~ Jfiz + Be JPz (1.9) 

is also a solution of (1.6). A and B are constants to be determined by 
boundary conditions. Since (1.6) is a second order ordinary differential 
equation and (1.9) has two constants to be determined, this is the most 
general form of the solution and no other functions have to be examined. 
The proof of this assertion for our special case is not difficult. Let V(z) be 
any solution of (1.6) with nonzero y 2 = - <o 2 LC. We shall assume that A 
and B are not constant but that they are functions of z satisfying the follow- 
ing equations. 

V(z) = A(z)e~ 7Z + B(z) e yz 



V'(z) = A (z)(— ye~ yz ) + B(z) ye yz 



( 110 ) 

(l.H) 
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where the prime indicates the derivative with respect to z. Since the deter- 
minant of the coefficients of A(z) and B(z) is not equal to zero, it is always 
possible to solve for A(z) and B(z) for a given V{z). From (1.10), V'(z) is 
calculated as 

V (z) = A (z) ( - ye~ yz ) + B( z) ye 7Z + A' (z) e"** + B' (z) e vz 

Since the first two terms on the right-hand side represent F'(z) as given by 
(1.11), we have 

A' (z) e~ 7z -I- B'(z) e yz — 0 (1.12) 

From (1.11), V"(z ) can be calculated: 

V"(z) = A(z) y 2 e~ yz + B(z) yV z + A’ (z) (- ye~ yz ) + B' (z) ye 7Z 

The first two terms on the right-hand side represent y 2 V(z). However, since 
V(z) is a solution of the differential equation V"(z) = y 2 V(z), we must have 

A' (z) ( — ye' 72 ) + B' (z) ye yz = 0 (1.13) 

From (1.12) and (1.13), both A'(z ) and B'{z) are found to be zero. Thus, we 
conclude that any solution of the differential equation can be expressed in 
the form (1.9) with A and B being constant. 

From (1.4), it follows that i(z, t) can also have the time factor e J<ot . 
Therefore, let us write i(z, t) = /(z) e j<ot . Substituting this and (1.9) into (1.3) 
we have 

-jo)LI(z) = -]PAe~ jPz +jpBe jPz 

Using the relation ft = gj(LC) 1/2 , the above equation can be rewritten in 
the form 

l(z) = Zq 1 (Ae~ Jfiz — Be jfiz ) (1.14) 

where 

z 0 = (l/c) ,/2 = y 0 -‘ (1.15) 

Since coL Az has the dimension of impedance and oC Az that of admittance 
(where L = inductance per unit length), (L/C) 1/2 has the dimension of 
impedance. It is, therefore, indicated by Z 0 and called the characteristic 
impedance of the line, its inverse y 0 is called the characteristic admittance. 

It follows from (1.9) and (1.14) that the voltage and current on the line are 
completely determined when the two constants A and B are fixed. This means 
that the voltage and current at every point on the line are automatically 
determined if the following conditions are specified : 

(i) both voltage and current are given at one point along the line; 
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(ii) the voltages are given at two different points a distance / apart where 
fil # nn (n: integer); 

(iii) the currents at these two points are given; and 

(iv) the ratio of the voltage to the current at one point and either the 
voltage or current at another are specified. 

Since both A and B are complex numbers, four real numbers are specified 
in the above procedure. 

Up to this point, our discussion has been based on the voltage and current. 
Let us now introduce the following quantities 

a{z) = iZo 1/2 {V(z) + Z 0 1 (z)} , b(z) = \Z~ 0 X < 2 {V{z) - Z 0 /(z)} 

( 1 . 16 ) 

which are the result of a linear transformation applied to V(z) and l(z). 
With Z 0 being fixed, both a(z ) and b(z ) are determined if V(z) and /(z) are 
given. Conversely, if a(z ) and b{z) are given, from the inverse transformation 

K(z) = Zj l2 {a(z) + b(z)}, I(z) = Z^{a(z)-b(z)} (1.17) 

both V(z) and /(z) can be obtained. Consequently, there should be no 
difference in the result whether the transmission line is studied in terms of 
V(z ) and /(z) or in terms of a(z ) and b(z). However, since a(z) and b{z) 
can be shown to have physical meaning which helps in the understanding 
of transmission line phenomena, it is worthwhile to introduce them. Before 
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Fig. 1.2. An example of coordinate-transformations. 
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explaining the physical meaning, let us look at a familiar example in which 
a linear transformation facilitates our study. Referring to Fig. 1.2, various 
properties of an ellipse can be studied in terms of the coordinates (a, j>) as 
well as in terms of (a',/) which are obtained by a linear transformation of 
(a, y). Since everything that can be explained in terms of (a', y) can be 
explained in terms of (a, 7), the linear transformation is not entirely neces- 
sary in the study of the ellipse. However, since the coordinates (a', y') give a 
simpler expression for the ellipse, properties which are not clear in terms of 
(a, y) may be disclosed in terms of (a',/). For this reason (a \y') has an 
advantage over (a, y), and one should be familiar with this kind of trans- 
formation. 

Let us now substitute (1.9) and (1.14) into (1.16) and investigate the 
physical meaning of a(z) and b{z). The results are 

a{z) = Zo t,2 Ae~ Jfz t b{z) = Z Q x ' 2 Be m (1.18) 

a(z ) and b(z) are directly related to A and B, respectively. It is worth noting 
that the magnitudes of both a(z) and b(z) are kept constant along the line. 
Their phases change but the angles are directly proportional to z. Thus, the 
variations of a(z) and b{z) with z are considerably simpler than those of 
V(z) and /(z). 

In the lower frequency ranges, a piece of wire is generally used to connect 
circuit elements together, and in this wire the voltage and current are kept 
constant. Voltage and current are primarily used for the study of networks 
in which many elements are connected together by means of conducting 
wires because of this simple relation of constancy. In the higher frequency 
ranges, the connections are made through transmission lines. When their 
length can no longer be neglected, the voltage and current vary with z in the 
complicated manner shown in (1.9) and (1.14). It is natural, therefore, to 
look for substitutes which are the most simple functions of z as possible 
while providing the same amount of information as V(z) and /(z). In order 
to satisfy the last condition, quantities obtainable through a linear trans- 
formation of V(z) and 7(z) are considered, and a{z) and b(z), which have 
constant magnitudes and phases directly proportional to z, are chosen. 
Because of these simple relations, a(z) and b(z) will be found particularly 
suitable for the study of transmission line phenomena. 

For the interpretation of a(z ) when quantities are varying sinusoidally 
with time, we have to take the real or imaginary part of J 2 e i<ot times a{z) 
just as we did for the interpretation of K(z). Let us consider the real part. 
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Referring to Fig. 1.3, we obtain using (1.18) 



Re {yj2 a (z) e jiat ) = ^2 Z 0 ” 1/2 Re { Ae j(iat ~ pz) } 

= yJ2 Zq 1/2 \A\ cos(cur — pz + cp) 

where 



tan cp = Im {/1}/Re {A} 



(1.19) 

( 1 . 20 ) 



With fixed t, the right-hand side of (1.19) represents a sinusoidal waveform 
as shown in Fig. 1.4. The wavelength X is equal to 2njfi. If Az = (a>!P) At, 







Fig. 1.3. Relation between two complex vectors A and Ae^ a>t ~^ l K 



the arguments of the cosine are equal at (z, t) and (z + Az, t + At), as we 
can easily see from 

cot - pz + cp = c o(t + At) — fi{z + (cojl^) At] + cp 
= co(t + At) — jl(z + Az) + cp 

Since z is arbitrary, this means that after the time interval At, the function 
as a whole is translated toward the positive z-direction by the amount Az as 
shown by the dotted line in Fig. 1.4. The velocity of the translation is given 

by 

v p = ( AzjAt ) = (cu/^) (1.21) 

From these observations, we conclude that (1.19) represents a wave moving 




1.1. Transmission Line Theory *1 

toward the positive z-direction with a constant velocity v p which is called 
the phase velocity. In our case, since p = c o(LC) 1/2 , 

Vp = c olp = (LC)~ 112 (1.22) 

Thus, we see that a(z) is a wave moving in the positive z-direction with the 
phase velocity (TC) -I/2 . Similarly, b(z) represents a wave moving in the 




Fig. 1.4. Explanation for the wave motion of a{z). 



negative z-direction with the same phase velocity. For this reason, they are 
called traveling waves. 

Finally, let us calculate the net power at z flowing toward the positive 
z direction. Referring to Fig. 1.5, the power is given by 

P = Re { V (z) /* (z)} = Re [{a (z) + b (z)} (a* (z) - b* (z)}] 

= Re {\a (z) | 2 - |6(z)| 2 + a* (z) b (z) - a (z) b* (z)} 




Fig. 1 .5. Relation between V, I, and P. 
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where use is made of (1.17). Since a*(z) b(z) — a(z)b*(z) is the difference 
between a complex number and its conjugate, it is always imaginary. 
Therefore, we have 

/> = | fl (z)| 2 -[i,(z)| 2 (1.23) 

This result can be interpreted as follows. The wave a(z) moving in the 
positive z-direction carries the power |a(z)| 2 , and similarly the wave b(z) 
carries the power \b{z)\ 2 toward the negative z-direction. The net power 
toward the positive z-direction is therefore given by \a(z)\ 2 — \b(z)\ 2 . In 
other words, each wave defined above is accompanied by the power equal 
to the square of its magnitude. 

The advantages of introducing a(z ) and b(z ) will be further exploited in 
connection with the following discussion of the Smith chart. 



1.2 Smith Chart 

When we study electric circuits in terms of voltage and current, their ratio, 
i.e., impedance, proves to be a useful quantity. Since voltage and current 
have been replaced by a(z) and b(z), let us take the ratio of the latter 
quantities : 

r = b{z)/a(z ) = (B/A) e i2f>z (1.24) 

This is called the reflection coefficient. At a reference plane z, if a(z ) is 
considered to be an incident wave, b(z) represents the reflected wave, and 
r expresses the magnitude and phase of the reflected wave relative to the 
incident wave. In other words, r is the reflection corresponding to a unit- 
incident wave. When the reference plane z is shifted toward the positive 
z-direction with a constant velocity, r rotates counterclockwise in the com- 
plex plane with an uniform speed without changing its magnitude as we 
can see from (1.24). When z changes by n/f} = A/2, r completes one rotation. 
If there is no power source on the right-hand side of the reference plane z, 
the net power flow toward the positive z-direction must be positive and 
hence \a(z)\ 2 > \b(z)\ 2 from (1.23). This means that \r\ is smaller than unity 
in cases in which the transmission line is terminated by a passive circuit. 

Although in Eq. (1.9), the variation of the magnitude of the line voltage 
with z appears, at first, to be complicated, the use of r provides a considerable 
simplification. From (1.17), (1.18), and (1.24), we have 

|K(z)| = |Zj' 2 ( a (z) + b(z)}\ = Zj 12 |a(z)| |1 + r| = \A\ |1 + r| (1.25) 




uniformly rotates with z, the vector diagram shown in Fig. 1.6 clarifies how 
|F(z)| varies with z. The reflection coefficient r rotates once as z shifts by 
A/2 hence the variation of |F(z)| repeats itself every half wavelength as 
shown by the solid line in Fig. 1.7. Similarly, from (1.17), the magnitude 
of the line current is given by 

|/(z)| = \Zq 1/2 {a (z) - b(z)}\=Z 0 l/2 \a(z)\ |1 - r| 

= Z 0 “ 1 \A\ |l-r| (1.26) 




Fig. 1.6. Vectors 1 + r and 1 — r. 




\V(z)\ is, therefore, proportional to |l + r|. Since 1 is a constant and r 



Fig. 1.7. Relative amplitudes of voltage and current. 
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Referring to Fig. 1.6, it is clear that the magnitude of current changes with 
z as shown by the dotted line in Fig. 1.7. At the maximum point of \V(z)\, 
|/(z)| becomes minimum and vice versa. Except for the fact that they are 
shifted by 2/4 relative to each other, the two curves have identical shape. 

As we have explained in connection with (1. 1 6) and (1.17), effects which 
can be discussed in terms of voltage and current can be explained in terms 
of a(z ) and b(z). Consequently, concept of impedance may seem redundant 
once the reflection coefficient is introduced. Inasmuch as we are already 
familiar with the concept of impedance at low frequencies and because the 
series or parallel connection of circuit elements can be expressed by the 
simple addition of their impedances or admittances, respectively, it is advan- 
tageous to use freely both the reflection coefficient and impedance, switching 
from one quantity to another during a course of circuit study. For this pur- 
pose, let us clarify the relation between the reflection coefficient r and 
impedance Z. By definition, the impedance is given as the ratio of V(z ) to 
I(z). Therefore using (1.17), we have 

z _ ^( z ) _ z '° /2 fo ( z ) + fe (*)} = 7 1 + r 

I(z) Zq 1/2 {a{z) — b(z)} °1 -r 

or equivalently, 

Z/Z 0 = (1 + r)/(l - r) (1.27) 



The left-hand side of (1.27) is a dimensionless quantity representing the 
impedance Z relative to Z 0 . This is called the normalized impedance. 
Solving (1.27) for r, the reflection coefficient can be expressed in terms of the 
normalized impedance: 



(Z/Z 0 ) - 1 
(Z/Z 0 ) + 1 



(1.28) 



From (1.27) and (1.28), we see that there is a one-to-one correspondence 
between r and Z/Z 0 . If Z/Z 0 is plotted on an r plane or if r is plotted on 
the Z/Z 0 plane, when one is given, the other can be obtained conveniently 
from the chart. When the reference plane is shifted, a new reflection coeffi- 
cient can be obtained on the r plane simply by rotating the original vector r 
by the angle corresponding to the distance between the new and old reference 
planes. Such a simple operation is not available for obtaining the new 
Z/Z 0 on the Z/Z 0 plane. For this reason, a plot of Z/Z 0 on the r plane is 
preferred. Among the various ways to plot Z/Z 0 , the most common is to 
draw two sets of curves: constant resistance and constant reactance. The 
real and imaginary parts of Z/Z 0 corresponding to a given r can be read 
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on the chart from the two curves intersecting at the point representing r. 
The reflection coefficient plane on which the normalized impedance is 
plotted in this way is called a Smith chart. 

In order to construct the Smith chart, let us first consider the constant 
resistance locus on the r plane, i.e., the image on the r plane of a straight 
line parallel to the imaginary axis on the Z/Z 0 plane, as shown by K 0 in 
Fig. 1.8(a). The strategy, here, is to decompose the transformation (1.28) 





( 0 ) (b) (c) 

Fig. 1.8. Drawing of Smith chart (1). 



into several simpler ones and to study the steps one by one. Equation (1.28) 
can be rewritten in the form 



2 

(Z/Z 0 )+l 



(1.29) 



The first step is to determine {(Z/Z 0 ) + 1}. For this transformation, each 
point on K 0 is shifted to the right by a unit distance giving K t in Fig. 1.8(a). 
We now use the result that the inverse of a complex number is expressed on 
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1.2. Smith Chart 
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the complex plane by a vector with an inverse magnitude and having an 
angle which is the negative of the original one. The inverse of each point on 
can therefore be plotted, resulting in circle K 2 shown in Fig. 1.8(b). 
This is 1 /{(Z/Z 0 )+ 1}. The proof that the inverse of a straight line is a 
circle is given in Section 1.3. Multiplying K 2 by — 2, we obtain circle K 3 
which corresponds to the second term on the right-hand side of (1.29). 
By adding 1 to K 3 , i.e., shifting K 3 to the right by a unit distance, the locus 
on the r plane is obtained which corresponds to a constant resistance line 
on the Z/Z 0 plane. This is circle K A in Fig. 1.8(c). For different values of 
i?/Z 0 , the radius of the circle changes. However, regardless of the value of 
R[Z 0 , the circle is symmetrical with respect to the real axis and always 
passes through the point (l,y'0). 

Next, let us consider the constant reactance locus. The straight line C 0 
in Fig. 1.9(a) for which Xj Z 0 is constant coincides with itself when trans- 




formed to {(Z/Z 0 ) -l- 1} as indicated by C,. The inverse of C t is circle C 2 , 
shown in Fig. 1.9(b), which represents 1/{(Z/Z 0 ) + 1), and from this 
- 2/{(Z/Z 0 ) -I- 1} is given by C 3 in Fig. 1.9(b). Shifting C 3 to the right by a 
unit distance, the desired locus on the r plane is obtained as shown by C 4 
in Fig. 1.9(c). For different values of XjZ 0 , the radius of the circle varies. 
For a negative value of X/Z 0 , the circle appears below the real axis. However, 
regardless of the value of XjZ 0 , the circle passes through the point ( 1 , y 0) 
and is tangent to the real axis. 




Thus, two sets of circles corresponding to various values of RjZ 0 and 
XjR 0 can be drawn on the r plane. Excluding the region in which |rj > 1, we 
obtain the Smith chart shown in Fig. 1.10. The region where |r| > 1 is 
excluded because the transmission line is ordinarily terminated by a passive 
network for which |r| is less than unity, as we explained before. It follows 
from Fig. 1.8 that the excluded part in the r plane of Fig. 1.8(c) corresponds 
to the left-hand side of the imaginary axis of the Z/Z 0 plane which is shown 
in Fig. 1.8(a), i.e., to the region where the resistance is negative. In addition 
to these two sets of curves, the Smith chart, shown in Fig. 1.10, has scales 
around it showing angles in the corresponding distances in wavelenghts. 
These are convenient for obtaining the reflection coefficients at different 
points along the transmission line when the distance is measured in wave- 
lengths. 

To illustrate how to use the Smith chart, let us discuss the case in which 
a load impedance Z L is connected at the far end of a transmission line. First 
ZJZq is calculated and the point P corresponding to ZJZ 0 is located on the 
Smith chart. The vector OP with its tip at P and its tail at the origin 0 on 
the r plane gives the reflection coefficient r L at the reference plane where Z L 
is connected. Let OP in Fig. 1.11 be the vector r L . If the reference plane is 
shifted by k wavelengths toward the generator, i.e., toward the negative 
z-direction, the reflection coefficient vector r rotates clockwise by k on the 
scale around the Smith chart. The reflection coefficient at the new reference 
plane is given by OQ in Fig. 1.11, and Z/Z 0 corresponding to Q gives the 
normalized impedance looking into the load at the new reference plane. 
The actual impedance is then obtained by multiplying this value by Z 0 . As 
the reference plane shifts further toward the generator, r rotates, and at a 
certain point the vector r lies on the real axis. This means that the impedance 
looking into the load from this point becomes purely resistive. Further 
movement of the plane by A/4 gives another pure resistance with the nor- 
malized value being the inverse of the previous one. The magnitude of the 
line voltage is proportional to |1 +r| which is equal to the length of the 
vector drawn from the point r = — 1 +y0 to the tip of vector r. Note that 
r = — 1 +j0 corresponds to Z/Z 0 = 0 + jO. In the case of Fig. 1.11, as the 
reference plane moves toward the generator from the load, the voltage first 
increases. It takes the maximum value when r lies on the real axis and then 
decreases to the minimum point located a quarter wavelength from the 
maximum. A further shift increases the voltage again and thereafter repeats 
the cycle. 

The ratio of the maximum to the minimum voltage is called the standing 




IMPEDANCE OR ADMITTANCE COORDINATES 




Fig. 1.10. Smith chart. (Reproduced by permission from Kay Electric Company, Pine Brook, New Jersey.) 
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wave ratio (SWR). From Fig. 1.11, the standing wave ratio is given by 

SWR = (1.30) 

i - kJ 

where r L is the reflection coefficient of the load. A comparison of (1.27) 
with (1.30) shows that the standing wave ratio is given by the normalized 
impedance at the point where r is equal to |r L |. To obtain the value of 
standing wave ratio, therefore, draw a circle with center at the origin O and 
passing through the point P. It crosses the real axis at two points. Take the 
right-hand point and read the normalized resistance. This gives the desired 
standing wave ratio. 




Now suppose that the standing wave ratio and the distance from the | 
load to the first voltage minimum point are given and the load impedance | 
is to be calculated. To obtain the normalized impedance, first draw a circle | 
with center at the origin passing through the point whose normalized impe- 
dance is equal to the standing wave ratio. Next, measure the given distance 
counterclockwise from the (0 + jO) impedance point on the scale around the 
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Smith chart and draw a straight line from there to the origin. This follows 
since minimum voltages always occur at points whose position corresponds 
to Z/Z 0 = 0 + jO. The intersection of this line and the circle previously 
drawn gives the reflection coefficient and, hence, the normalized impedance 
of the load. In this way, the normalized impedance can be obtained once the 
SWR along a transmission line and the voltage minimum point have been 
measured. This is called the standing wave method and is one of the most 
commonly used methods of measuring high frequency impedances. Once the 
charateristic impedance is calculated or calibrated against some standard 
impedance, the unknown impedance can be determined. 

So far, the voltage and impedance have been almost exclusively used. 
Essentially the same procedure can be developed using current and admit- 
tance, but this will be omitted except for some comments concerning the 
following useful relations. The inverse of the normalized impedance is 
called the normalized admittance, since 

(z/z 0 ) _1 = z 0 /z = y/y 0 

Taking the inverse of (1.27), we have 

YfY 0 = (l-r)l(l+r) (1.31) 

A comparison of (1.27) with (1.31) shows that if r is replaced by — r, the 
normalized impedance is transformed into the normalized admittance. 
Since the normalized impedance is given at the point r on the Smith chart, 
the normalize admittance is obtained at the point — r. In other words, 
given a normalized impedance on the Smith chart, the normalized admittance 
is obtained at the opposite point with respect to the origin. This means that 
the same chart can be used for admittance as well as for impedance calcula- 
tions. The point (- l,y'0) on the r plane corresponds to the normalized 
impedance 0 + jO and also to the normalized admittance oo -fy'co. 

At this point, in order to become more familiar with the Smith chart, 
let us solve a particular problem. 

Exercise: Suppose that a 50 Q transmission line is terminated by a 30 ft 
resistor. By connecting a capacitor in shunt at an appropriate point along 
the line, eliminate the reflection toward the generator. 

Answer: The normalized impedance of the load is given by 

Z/Z 0 = (30 + ;0)/50 = 0.6 + jO 

Let us draw a circle with center at the origin passing through the point 
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whose normalized impedance is 0.6 + jO. The normalized impedance looking 
into the load at any point along the line should be found on this circle. 
The normalized admittance is also located on the same circle. We wish to 
eliminate the reflection by inserting a capacitor in shunt. To do this, on the 
same circle, we must find a point whose normalized admittance plus 
jb ( b > 0) is equal to 1 + jO, implying no reflection where b is the normalized 
susceptance of the capacitor to be determined. The desired point corresponds 
to the normalized admittance 1 — jb. Therefore, the point we are seeking is 
one of the intersections between the above circle and the unit conductance 
circle. Since b > 0, the imaginary part must be negative. Thus, the normal* 
ized admittance is determined on the Smith chart as 1 — y'0.5. The opposite 




Fig. 1.12. Example of matching circuit. 



point with respect to the origin gives the normalized impedance. On the scale 
around the Smith chart, this point is found to be about 0.103 wavelength 
away from the load. In other words, if a capacitor, whose normalized 
admittance is y'0.5, is inserted in shunt at a point 0.103 wavelength from the 
resistor, the reflection is canceled out and the generator sees a matched load. 
The normalized impedance of the capacitor is — j 2 and the impedance 
— y'lOOfi. The circuit configuration is shown in Fig. 1.12(a). The same 
normalized admittance j0.5 can also be obtained using an open-ended 50 Q 
transmission line about 0.073 wavelength long which is easily seen using the 
Smith chart. Thus, Figs. 1.12(a) and (b) are equivalent to each other. 

In the above discussion, the losses of the transmission line have been 
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completely neglected. However, it should be remembered that because of the 
resistance in the wire and in the ground plane and because of the possible 
dielectric loss in the medium, the magnitudes of the waves along an actual 
transmission line decrease as they progress. If the losses are taken into 
account, the propagation constant y which was defined in (1.7) becomes a 
complex quantity a +jfi and, at the same time, a nonzero imaginary part 
appears in Z 0 . The real part of y, a, is called the attenuation constant. When 
the losses are small, both a and the imaginary part of Z 0 are small and of 
the same order of magnitude. The effect of a, however, appears in the 
exponential form e ~ az , and it becomes larger as |z| increases; in other words, 
a is a measure of the loss per unit length. On the other hand, the effect of the 
imaginary part of Z 0 has no such enhancement with increasing \z\. For these 
reasons, Z 0 is generally assumed to be real, and y complex. In this case, 

r = (B/A) e 2az+j2ez (1.32) 

The locus of r, therefore, becomes a spiral instead of a circle, with a radius 
which increases with increasing z. If the reference plane is moved away from 
the load, [rj gets smaller. When the generator is sufficiently far away from 
the load, it sees no reflection regardless of the load impedance at the other 
end. If r is given at z lt and r is required at another point z 2 , the magnitude 
has to be multiplied by e 2a ( zj_z '> while the phase must be rotated by 
e j 2 p(z 1 -zi) p r0 vided that r is transformed in this manner, the normalized 
impedance as well as the admittance can be obtained in the same manner 
as in the previous lossless case. However, the magnitude of the voltage or 
current is no longer porportional to j 1 + r\ or 1 1 — r\, respectively, as z varies. 



1.3 Bilinear Transformations 

A complex variable w is described as being a bilinear transformation of 
another complex variable z when w is expressed in the form 



az + b 
cz +d 



(1.33) 



where a, b, c, and cl are constants. Solving (1.33) for z, we have 



— dw + b 

z = 

cw — a 



(1.34) 



This shows that when w is a bilinear transformation of z, z is also a bilinear 
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transformation of w. From (1.27) and (1.28), Z/Z 0 and r are bilinear trans- 
formations of each other. In this section, we shall discuss some of elementary 
properties of bilinear transformations and derive an equivalent circuit of 
two-port networks. Suppose that v is a bilinear transformation of w, i.e., 



ew + f 
gw + h 



(1.35) 



where e,fg, and h are constants. Since the substitution of (1.33) into 
(1.35) gives 

(ea + fc)z + ( eb + fd) 

(ga + he) z + (gb + hd ) 



v is a bilinear transformation of z. It follows from this that a bilinear 
transformation of a bilinear transformation is also a bilinear transformation. 

Next, let us prove the following important theorem of bilinear trans- 
formations: Any circle on the complex plane is transformed into a circle by 
a bilinear transformation. In this statement, a circle can be a straight line 
as the limiting case of increasing the radius and removing the center away 
from the origin. First, let us study a particular case in which the bilinear 
transformation w of z is simply the inverse of z and the circle on the z plane 
passes through the origin. Let D be the diameter and q> be the angle between 
the real axis and the diameter through the origin, as shown in Fig. 1.13. 
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Then, the circle is represented by 

z = D cos 6 e j(v+e) 



where 6 varies from - njl to ji/ 2. Taking the inverse, 



1 e J<p e }B e jv cos 6 —j sin 9 
z D cos 6 D cos 0 



e 

— (1 ~j tan0) 



When 0 varies, (1 — j tan 9)/D represents a straight line parallel to the 
imaginary axis and 1/Z) away from it. By rotating this straight line around 
the origin by - <p , the locus of 1 jz is obtained. Thus, we see that the inverse 
image of a circle passing through the origin is a straight line. Conversely, 
the inverse image of a straight line which does not pass through the origin 
is a circle passing through the origin. The circles K 2 in Fig. 1.8(b) and C 2 
in Fig. 1.9(b) are examples. 

Since the inverse of a complex number is obtained by taking the inverse of 
its magnitude and changing the sign of its angle, the inverse image of a 
straight line passing through the origin is a straight line symmetric to it with 
respect to the real axis. 

Next, let us consider the case in which the circle does not pass through 
the origin; 1/z can always be expressed in the form 



i _ m i 

z — kj(k + z) + 1 k 



(1.36) 



where k is a nonzero constant. This is easily proved by simplifying the right- 
hand side of the equation. When z represents a circle or a straight line which 
does not pass through the origin, k + z can be made to pass through the 
origin by choosing a proper nonzero constant k. Then, from the previous 
discussions, 1 f(k + z) becomes a straight line. A constant times a straight 
line is also a straight line, and the addition of a constant does not change 
the shape of the locus. Therefore, — k{(k + z) + 1 is a straight line. This 
line does not pass through the origin for the following reason. Assume to 
the contrary, then, there should be a point z which satisfies 



- k/(k + z) -|- 1 = zj(k + z) = 0 



This, however, requires that z = 0, i.e., the original locus passes through 
the origin contradicting the hypothesis. Thus, the straight line we obtained 
does not pass through the origin. The inverse of this line is a circle. Thus, the 
first term on the right-hand side of (1.36) is a circle and hence 1/z is also a 
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circle. In other words, the inverse image of a circle or a straight line which 
does not pass through the origin is a circle. Combining this with the results / 

obtained previously, we conclude that the inverse image of a circle or a 
straight line is a circle or a straight line depending on whether the original 
locus passes through the origin or not. 

We are now in a position to discuss the general case. In general, a bilinear 
transformation can be rewritten in the form 

az + b a b — (adjc) 

w = = - + 7 - (1-37) 

cz + d c cz + d \ 

where c is assumed not to be equal to zero. When c = 0, w is a simple linear 
function of z, and it is obvious that w represents a circle when z is a circle. 

Therefore, we shall concentrate on the case is which c 0. When z represents 
a circle or a straight line on the complex plane, cz + d also represents a 
circle or a straight line. Since the second term on the right-hand side of 
(1.37) is a constant times the inverse of cz + d, it is a circle or a straight line 
according to the previous discussion. Adding ajc, w is seen to be a circle 
or a straight line. This completes the proof of the theorem. : 

When a circle is transformed into a circle by a bilinear transformation, r 

there are two possible cases: (i) The inside of one circle is transformed to 
the inside of the other; and (ii) The inside of one circle corresponds to the 
outside of the other. In the first case, the rotational directions of the 




Fig. 1.14. The directions of rotation are the same when the inside of one circle 
corresponds to the inside of the other. 
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corresponding points on the circles are the same while in the second case, 
they are opposite to each other. The proof is as follows: Since any bilinear 
transformation can be decomposed into the inverse transformation, 
multiplications by a constant and additions of a constant, and since the last 
two operations do not change the rotational direction, we have only to 
discuss the inverse transformations. When the origin is outside of a circle 
as shown in Fig. 1.14, the transformed circle does not include the origin, 
and the rotational directions of the corresponding points on the circles 
are the same. This is obvious from the fact that the inverse of a complex 
number is represented by a complex vector whose magnitude is the inverse 
of the original magnitude and whose angle is the negative of the original 
angle. When the origin is inside of a circle as shown in Fig. 1.15, the trans- 




Fig. 1.15. The directions of rotation are opposite when the inside of one circle 
corresponds to the outside of the other. 

formed circle encloses the origin, and, since the inverse of the origin corre- 
sponds to infinity, it is obvious that the inside of one circle corresponds to 
the outside of the other. Furthermore, for the same reason used above, the 
rotational directions of the corresponding points on the circles become 
opposite. This completes the proof. From Figs. 1.14 and 1.15, it can be seen 
that corresponding points move in such a way that corresponding areas 
would appear on the same side if a person were to stand on each circle 
facing in the direction of the moving point. This statement is also true 
when one circle becomes a straight line as a limiting case. 

Next, let us derive an equivalent circuit of reciprocal two-port networks 
as an example of using belinear transformations. Suppose the relation be- 
tween the terminal voltages and currents of a given two-port network is 
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expressed by 

F 1 =Z 11 / 1 +Z 12 / 2 (1.38) 

V 2 = Z l2 I l+ Z 22 I 2 (1.39) 

where the polarities of the voltages and currents are as shown in Fig. 1.16. 
Because of the reciprocity of the network under consideration, the coefficient 
of J 2 in (1.38) is equal to that of I t in (1.39). We shall discuss the reciprocity 
of microwave circuits in detail in Chapter 5. 




I 2 

Fig. 1.16. Two-port network. 



If a load impedance Z L is connected to port 2 of the two-port network, 
V 2 is given by 

V 2 = " z L / 2 

Substituting this into (1.39), J 2 can be obtained in terms of J t . Then, sub- 
stituting this result into (1.38), we have 

K = {z„ -z? 2 (z L + z 22 )-‘}/i 

This means that when port 2 is terminated by Z L , the input impedance at 
port 1 becomes 

Zj = V\!h = {z n - z\ 2 (Z L + z 22 y 1 } (1.40) 

Thus, Z, is a bilinear transformation of Z L . When Z L changes from —j co 
to j<x) as shown in Fig. 1.17(a), the locus of Z { is a circle as shown in Fig. 
1.17(b). The circle is in the right-half plane with its internal area correspond- 
ing to the right-half plane of Z L for the following reason. Assume the 
contrary, then Z, can have a negative real part when Z L has a positive real 
part. However, this is impossible since the passive two-port network 
terminated by a load impedance with a positive real part is a passive circuit 
and the input impedance must have a positive real part. 
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Let Zj(l) = R { +jX j be the point with the smallest real part on the circle 
and let jxy be the corresponding Z L . Similarly, let Z,(2) = R 2 +jX 2 be the 
point with the largest real part and let jx 2 be the corresponding Z L . Finally, 
let jx 3 be a different point on the imaginary axis of Z L and let the corre- 
sponding point on Z, be as shown by (jx 3 ) in Fig. 1.17(b). When Z f traces 
out a circle, Z, - (Ry + jXy) represents a circle passing through the origin 
tangential to the imaginary axis as shown in Fig. 1.18(a). The other point at 
which the circle intersects the real axis is given by R 2 - Ry. The inverse of 
this circle, i.e. the admittance, becomes a constant conductance line with a 
value lj(R 2 — Ry) as shown in Fig. 1.18(b). When \/(R 2 — R 1 ) is sub- 
tracted, the admittance becomes a pure susceptance, jB. It is easily seen 
from Figs. 1.17 and 1.18 that the susceptance becomes zero when Z L is 
equal to jx 2 and infinite when Z L is equal to jx v An equivalent circuit for 
jB with these properties is shown in Fig. 1.19. The transformer ratio n is to 
be determined by the requirement that jB becomes jB 3 , shown in Fig. 1 . 1 8(b), 
when Z L becomes jx 3 . Since the admittance is increased n 2 times by the 
transformer, it follows that 

W (* 3 - Xi)}' 1 + {/'(*! - x 2 )} _1 ] " 2 = jB 3 
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Fig. 1.19. An equivalent circuit for jB = {Zf — ( Ri + jX i} -1 — {jfa 



or equivalently, 



B 3 (x 3 -x l )(x 1 -x 2 ) 



(1.41) 



When Zj( 3), shown by (jx 3 ) in Fig. 1.17(b), is located below the jX t level, 
B 3 is positive as shown in Fig. 1.18(b). 

From the discussion on the directions of corresponding moving points, 
it is clear that the point on the circle rotates clockwise when the corre- 
sponding point moves upward on the imaginary axis of the Z L plane.There- 
fore, only one of three cases is possible, depending on the location of the 
point corresponding to Z L =y'o o on the plane x x > x 3 > x 2 , x 3 > x 2 > x u 

x 2 > x i > x 3 . For each of these three cases, since B 3 > 0, Eq. (1.41) gives 
positive n 2 and, hence, the transformer ratio n is realizable. Similarly, if the 
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point (jx 3 ) is located above the jX 1 level, B 3 is negative and one of three 
inequalities, 

X 2 > X 3 > Xj, x 3 >x 1 >x 2 , x x > x 2 > x 3 

has to be satisfied. This means that n 2 is again positive and the transformer 
ratio is realizable. Thus, the circuit shown in Fig. 1.19 is realizable and gives 
correct values of jB at three different values of Z L = jx. 

It follows from the foregoing discussion that the circuit shown in Fig. 
1.20 gives the correct values of input impedance Z ( (l), Z ; (2), and Z^(3) for 
three different values of Z L Jx t Jx 2 , and jx 3 . Note that the impedance 
R t + jX j and conductance 1/(J? 2 — R-i) subtracted out to obtain jB are all 




Fig. 1.20. An equivlaent circuit for reciprocal two-port networks. 




restored in Fig. 1.20. Let us consider the two-port network inside the dotted 
line. This can be specified by equations similar to (1.38) and (1.39). Let 
Z' lu Z[ 2 , and Z' 12 be the coefficients corresponding to Z ll5 Z 12 , and Z 22 , 
respectively. Then, we have 

Z,(k) = {Z\ , - Z\\ (jx k + Z' 22 y 1 } (fc = 1. 2, 3) 

Multiplying both sides by jx k + a little manipulation gives 

Mz; 1 - Zfk) Z 22 + (Z\fZ' 22 - Z\\) = Z i {k)jx k ( k = 1, 2, 3) (1.42) 

This can be considered as three simultaneous linear equations for three 
unknown quantities Z[ u Z 22 , and Z[ y Z 22 — Z[\. When the equations are 
solved, these quantities are uniquely determined in terms of x k and Zfk). 
On the other hand, from the network defined by (1.38) and (1.39), three 
simultaneous equations are also obtained for Z n , Z 22 , and Z iy Z 22 - Z\ 2 
with identical coefficients to those in (1.42). Therefore, Z' xl> Z 22 , and 
z ii z 22 — z i 2 must be equal to Z u , Z 22 , and Z U Z 22 - Z\ 2 , respectively. 
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That is, 

Zn=Zi., Z 2 2 = Z22, Zi2 = Z'i2 (1-43) 

From (1.43) we see, except for the ambiguity in the sign of Z [ 2 , that Fig. 1.20 
is equivalent to the given network. The ambiguity in the sign of Z[ 2 can not 
be eliminated from the above discussion. However, since n 2 but not n itself 
is specified by (1.41), if the sign is opposite to that of the given network, 
Z[ 2 can be made equal to Z 12 by reversing the transformer polarity without 
disturbing other conditions. Thus, we conclude that the desired equivalent 
circuit is given by Fig. 1.20 with a proper choice of transformer polarity. 

When the network is lossless, the resistances in the equivalent circuit can 
be eliminated. Furthermore, if a certain length of transmission line is con- 
sidered part of the network, the equivalent circuit takes a particularly simple 
form. Suppose that a transmission line is connected at port 1 of the given 
network and that port 2 is open-circuited. Then, since the circuit is lossless, 
the incident and reflected waves a(z ) and b(z) must satisfy the conditions 
ja(z)| 2 — \b(z)\ 2 =0, | r\ = 1, and the standing wave ratio is infinite. Let us 
shift the reference plane (along the transmission line) to a voltage maximum 
point and consider the circuit consisting of the given two-port network and 
transmission line beyond this new reference plane, as shown by Fig. 1.21. 




Fig. 1.21. Composite circuit of two-port network and a length of transmission line. 

For this new two-port network, Zf = oo when Z L = oo. Therefore, the 
parallel reactance ./(xj — x 2 ) in Fig. 1.20 is also eliminated. The equivalent 
circuit becomes a series reactance, a transformer, and another series 
reactance. However, the series reactance on the left of the transformer can 
be transferred to the right after multiplying by n 2 . Therefore, the equivalent 
circuit of Fig. 1.21 becomes simply a transformer and a series reactance as 
shown in Fig. 1.22. Note that in addition to these two parameters, the 
length of the transmission line has been specified thus making a total of 
three independent parameters. To determine the transformer ratio n and the 
reactance X, the Z/s for two different Z L ’s other than Z L = oo are necessary. 
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jx 




I : n 

Fig. 1 .22. An equivalent circuit of Fig. 1 .2 1 . 

For example, let Z t be zero when Z L — — jx lt then X is equal to x v Further- 
more, if Z f = jX 0 when Z L = 0, n 2 is given by XfX 0 . 




1.4 Power Waves 

Let us consider a linear one-port network containing voltage and current 
sources. First suppose that a voltage source with the same magnitude as 
that of the open-circuited voltage at the terminals, but with the opposite 
phase, is inserted in series before connecting to a load. The effects of all the 
sources within the network and the added voltage source cancel each other 
and the circuit as a whole acts as a simple impedance Z g as far as the outside 
phenomena are concerned. Next, suppose that the sources inside the one- 
port network are all nullified and the phase of the added voltage source is 
reversed. The super-position of these two cases is equivalent to the original 
one-port network. As a result, any linear one-port network can be repre- 
sented by the series connection of Z g and a voltage source equal to the open- 
circuited terminal voltage. Therefore, in the following discussion we shall 
use Fig. 1.23 to represent an arbitrary linear one-port generator. Referring 




Fig. 1.23. An equivalent circuit of one-port generators. 
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to Fig. 1.23, let V 1 be the voltage across the load impedance Z L and I x be 
the current flowing through it. The power consumed in the load is given by 

Pl = Re {Z L } |/,l 2 

Since the magnitude of the current is \EJ(Z L + Z g )\, P L is rewritten in the 
form 



Pl = Rl 



Ex 2 = __ R l\Ex \ 2 

Z L + Z g (R L + Rg y+(X L + X g y 



= Ey4R 9 + 



(r l -r 0 ) 2 . (x L + x g y 



(1.44) 



(1.45) 



where R L and R g are the real parts of Z L and Z g , respectively and X L and 
X g are the imaginary parts. With R g > 0, we can easily see from (1.45), that 
the power becomes maximum when 



Rl = K> Xl = -X 9 



(1.46) 



This maximum power is called the available power of the generator and 
indicated by P a : 

P a = \E 1 \ 2 l4R g (R g > 0) (1.47) 



and P a is solely determined by the parameters of the generator, hence it is a 
characteristic of the generator independent of the load. In order to realize 
the maximum power consumption, (1.46) has to be satisfied. This condition, 
Z L = Z g *, is called the matching condition and if it is not satisfied, the load 
consumes less power than P a . 

When R g is negative, P L becomes infinite as R L and X L approach — R g 
and — X g , respectively, as can easily be seen from (1.44), therefore the 
available power is infinite if R g is negative. However, the power expressed 
by the right-hand side of (1.47) remains finite for any nonzero R g , and it is 
called the exchangeable power P e of the generator. That is, 

P. = \Eyi4R a (R,$ 0) (1.48) 

When R g is positive, P e is equal to the available power from the generator, 
however, if R g is negative, it is no longer the maximum power available. 
Since a small variation in Z L from Z* produces only a second order 
variation in the right-hand side of (1.45), P e can be considered as the sta- 
tionary value of P L with respect to a variation of Z L . Sometimes, P e is 
called the characteristic power of the generator, emphasizing that it is 
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invariant to nonsingular lossless transformations as we shall explain in 
Section 5.3 and again in Section 7.1. 

With this preparation, let us introduce power waves a x and b l which are 
defined by linear transformations of V x and /, : 

ai =i|ReZ s r" 2 (F 1 +Z,/ 1 ). 6 1 = i|ReZ,|- , ' 1 (K 1 -Z,*/ 1 ) (1.49) 

Note the similarity between these equations and the waves (1.16) introduced 
in Section 1.1. With a fixed Z g , and if V t and /, are given, then a x and b x 
can be readily calculated from (1.49). On the other hand, if a x and b x are 
given, V x and I t are obtained from the inverse transformation 



V t = Pl |ReZ ,r ,,2 (Z,*a l + Z„6,), /, = Pl |ReZ,r 1/2 (a 1 - ft,) 



where /Jj is defined by 



when Re Z g > 0 
when Re Z g < 0 



(1.50) 



(1.51) 



Thus, any result in terms of one set of variables can easily be converted to 
that in terms of the other set of variables. This justifies the use of a x and b x 
defined by (1.49) in place of the terminal voltage and current for any 
analysis. Referring to Fig. 1.23, V 1 is given by 



V l =E 1 ~ ZL 



From this and (1-49), we have 



which is equivalent to 



\a,\ z = \E 1 \ 2 l4\R g | 

P e = \a 1 \ 2 \R q \IR g = p 1 \a l \ 



(1.52) 



Next, let us consider the meaning of \a x \ 2 — \b x \ 2 . From (1.49), this becomes 

, ,2 ,, ,2 _ (Vx + Zgl^iV* + Z 9 *I*) - ( V t - Z*I t )(V* - Zgl*) 

,a, l _lil1 ■ 

= (Z . + Z , *)(V l I* + y*I l ) 

4| RJ 

= T7T, Re {Vih*} 



from which we have 



Re{F 1 / 1 *} = Pl (|« 1 | 2 -|6 1 | 2 ) 



(1.53) 



Since RelFj/!*} is the net power transferred from the generator to the 
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load, we see from (1.52) and (1.53) that the exchangeable power is given by 
Pj I a x \ 2 and the net power by p x (\a x \ 2 — \b x \ 2 ). This leads to the following 
interpretation: The generator sends the exchangeable power p x \a x \ 2 to the 
load, however, p t |6 t | 2 is reflected back to the generator, and hence the net 
power to the load is given by p x (|a,| 2 - |6i| 2 ), where a v and b x are the 
waves associated with these forward and reflected powers. Since \b x \ 2 is 
nonnegative, \a l \ 2 — \b x \ 2 becomes maximum when \b x \ 2 is equal to zero. 
Therefore, whether the load contains some power sources or not, the mag- 
nitude of the exchangeable power can be identified as the maximum power 
that the generator can supply when R g > 0 and the maximum power that 
the generator can absorb when R g < 0. In other words, the maximum power 
that the generator can exchange with the external circuit is the exchangeable 
power although the generator can absorb, or supply, more than jPJ when 
R g > 0 or R g < 0, respectively. 

Power waves a l and b 1 use the generator impedance in their definition. 
Similar waves a 2 and b 2 can be defined from the other side of the center 
line of the circuit in Fig. 1 .23, using the load impedance rather than generator 
impedance in the definition. In this case, the direction of the current has to 
be reversed. Thus, we have 

“2 = WeZiT m (y 2 + Z L l 2 ) = i|Re2J- , ' 2 (K 1 - ZJ,) 

b 2 = i|ReZ t n ' l2 (V 2 - Z’h) = i|ReZ l r ,,2 (F 1 + Z L *I,) (1.54) 

Note that a l and b x are not necessarily equal to b 2 and a 2 , respectively. 
Let p 2 be + 1 or — 1 when the sign of R eZ L is positive or negative, respec- 
tively. Then, p 2 \a 2 \ 2 represents the exchangeable power of the load. If the 
open-circuited terminal voltage of the load is zero, this is equal to zero. 
The net power transferred from the load to the generator is given by 
p 2 {\ a i\ 2 — 1^2 1 2 )- Therefore, the net power from the generator to the load is 
given by p 2 (\b 2 \ 2 — \a 2 \ 2 ). Although p t \b t \ 2 was considered as the power 
reflected from the load back to the generator, we shall call p 2 \b 2 \ 2 the 
actual power flowing into the load. Note that because of the voltage source 
in the generator, it can be a finite quantity even when the exchangeable 
power of the load is zero and hence the load does not send out any power to 
be reflected back. The actual power is equal to the net power transferred 
to the load plus the exchangeable power of the load, if any. Of course, 
p x \b x \ 2 could be called the actual power flowing into the generator. 

Let us next introduce the power wave reflection coefficient s x defined by 

5 i — (b l /a l ) (1.55) 






1.4. Power Waves 



37 




When the exchangeable power of the load is equal to zero, V x = Z L I X and s x 
can be expressed in terms of impedances: 

S l = (Z L ~ Zg*)l(Z L + Zg) (1.56) 

Substituting Z g = R g + jX g and Z L = R L + jX L into (1 .56), s x can be rewritten 
in the form 

R l +j(X L + X,) -f R, 

Comparing this expression with that of the reflection coefficient (1.28), 
we see that s x corresponds to the vector drawn from the center of the 
Smith chart to the point where the normalized impedance is given by 
{R l + j{X L + X g )}lR g . In other words, if the reactive part of Z g is added 
to Z L and normalized with respect to the real part of Z g , the correspond- 
ing point on the Smith chart gives the magnitude and phase of the power 
wave reflection coefficient. From this, the following important property of 
s x is derived: When R g and R L have the same sign, |s,| < 1, and when they 
have opposite signs, \s x \ > 1. 

The power reflection coefficient is given by 

\ Si \* = \(Zl-Z;)I(Z l + ZX (1.58) 

When the matching condition (1.46) is satisfied, the power reflection coeffi- 
cient becomes zero, as is expected. 

Looking into the load from the generator, s x and |s x | 2 are the reflection 
coefficients. The corresponding reflection coefficients s 2 and |.y 2 | 2 looking 
into the generator from the load are given by 

s 2 = (Z, - Z*)I(Z, + Z L ) , tel 2 = |(Z, - Z*)j(Z, + Z L f 

where the subscripts g and L are interchanged. The reflection coefficient s 2 
is not necessarily equal to s x . However, since \Z g — Z L *\ = \Z* — Z L \ 
— \Z L — Z g * |, |5 2 | 2 is always equal to l^l 2 . 

The quantity 1 - |s t | 2 is called the power transmission coefficient which is, 
of course, equal to 1 - |s 2 | 2 - It is worth noting that the power transmission 
coefficient multiplied by the exchangeable power of the generator is equal 
to the net power transferred to the load and conversely the net power 
divided by the power transmission coefficient is the exchangeable power. 
If Z g is replaced by positive real Z 0 , the power waves a x and b x defined 




38 



1. ELEMENTS OF CONVENTIONAL CIRCUIT THEORY 



Problems 
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by (1.49) become identical to the traveling waves a(z) and b(z ) given by 
(1.16). As a result, the various relations derived for power waves also hold 
for traveling waves provided that Z fl is replaced by positive real Z 0 . One 
example is the result that the power transmitted in the z-positive direction 
is given by \a{z)\ 2 - \b{z)\ 2 . The same relation is easily derived from (1.53) 
since is equal to plus one in that case. On the other hand, when the char- 
acteristic impedance is complex, the situation becomes slightly different, 
and \a(z)\ 2 - \b{z)\ 2 no longer gives the net power Re{P7*}. Furthermore, 
as we shall see in Chapter 2, when there is no reflection in terms of power 
waves, the traveling wave has some reflection from the load and vice versa. 

PROBLEMS 

1.1 Derive the transmission line equations corresponding to (1.3) and (1.4) and (1.5) 
taking into account the resistance of the wire. Then calculate the characteristic 
impedance and propagation constant of the line. 

1.2 Suppose that a length of 50 n transmission line is terminated by a 20 fi resistor 
and 30 pF capacitor connected in series and calculate the SWR and first voltage 
minimum point at 1 GHz using the Smith chart. 

1.3 Suppose that the SWR is 2.6 and the first voltage minimum point is 0.338 wavelengths 
away from a load terminating a 50 D transmission line. Calculate the impedance of 
the load on the Smith chart. 

1 .4 Suppose that a 50 Q transmission line is terminated by (1 5 + j\ 5) D and the incident 
power is 5 kW. Calculate the net power consumed in the termination. Also calculate 
the maximum voltage on the line. 

1.5 Calculate the inverse of 1.0 + /1.0 using the Smith chart. 

1.6 Convert the circuit shown in Fig. 1.24 to the equivalent circuit corresponding to 
Fig. 1.20. 



1.7 Prove that the exchangeable power is invariant to the insertion of ideal transformers 
or the connection of shunt and series reactances. 




Fig. 1.25. Pl as a function of Rl when R g > 0 and X g + Xl = 0. 

1.8 When Xl + X g is equal to zero and R g > 0, Pl varies with Rl as shown in Fig. 1 .25. 
Investigate the case in which < 0 and Xl + X g is not necessarily equal to zero. 
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Fig. 1.24. A two-port network. 
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CHAPTER 2 

ELECTROMAGNETIC 
FIELD VECTORS 



In order to discuss microwave circuits, Maxwell’s equations must be 
studied since these describe the relations between electric and magnetic 
fields. If the concept of a vector is introduced, the relations between the 
fields become simpler to describe and easier to understand. Therefore, we 
shall make extensive use of vectors in this book. This chapter reviews some 
of the important theorems on vector analysis which will facilitate our later 
study. In Section 2.1, elements of vectors and vector analysis are presented, 
and in Section 2.2, Maxwell’s equations are explained in terms of the field 
vectors in order to refresh the reader’s understanding of electromagnetic 
theory. Section 2.3 gives an analysis of plane waves, first in terms of scalar 
quantities resolving the field vectors into their components, and then in 
terms of vectors after which the results are compared. 



2.1 Vectors 

A vector is a quantity with magnitude and direction. To visualize it, we 
usually consider an arrow whose length and direction expresses the magnitude 
and direction of the vector, respectively, as shown in Fig. 2.1. The space of 
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Fig. 2.1. Arrow representing vector A. 
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this figure is not the actual space in which we live, but rather it is an abstract 
space where the magnitude and direction of the vector is represented by the 
length and direction of the corresponding arrow. Vectors are generally 
functions of time and position existing in actual space, therefore, depending 
on the position and time, the lengths and the directions of the vector arrows 
vary in abstract space. Sometimes however, the tail of the arrow is located 
at the point in the actual space where the vector is considered. An example 
will be shown in Fig. 2.10. In such a case, the illustrated space has two 
meanings; one is the actual space, and the other is the abstract space in 
which we consider the arrows representing vectors. Vectors representing the 
velocity of liquid, electric field, or magnetic field, which are functions of 
position, are sometimes represented by a cluster of curves with arrow heads. 
In this case, the direction of the vector at each point on a curve is given by 
the tangent in the direction indicated by the arrow. The magnitude of the 
vector at a particular point is given by the density of curves in the vicinity 
of that point. 

When the length of an arrow shrinks to zero, its direction loses meaning. 
The corresponding vector is called a zero vector and is indicated by 0. 
The vector which has the same direction as a vector A, but with a magnitude 
k times as large, is indicated by kX. The addition of two vectors A and B is 
defined as follows. Place the tails of the corresponding arrows at the same 
point in abstract space as shown in Fig. 2.2 and consider the plane which 




Fig. 2.2. Addition of two vectors A and B. 

they define. On this plane construct a parallelogram with two of its sides 
coinciding with the arrows. Then, the arrow C corresponding to the diagonal 
as shown in Fig. 2.2 represents the vector A + B. Alternatively, translate B 
until its tail coincides with the tip of vector A. If an arrow C is now drawn 
from the tail of A to the tip of B, then C represents A + B. In these operations, 
it is immaterial whether A or B comes first, i.e., the addition is commutative. 
Thus, we have 
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A+B=B+A=C 



( 2 . 1 ) 
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Furthermore, note that A and B could be vectors defined at different times 
and different points in the actual space. 

Vectors with magnitudes equal to unity are called unit vectors. Let the 
unit vectors i x , i r and i z have the directions of the x, y, and z axes of a rec- 
tangular coordinate system. Then, as is easily seen from Fig. 2.3, an arbitrary 
vector A can be expressed as 

A = i x A x + i y A y + \ Z A Z (2.2) 




Fig. 2.3. Vector A and its rectangular components. 

A x , A y , and A z are called the y, and z components of vector A while i x A x 
is called the projection of A in the x-direction or the projection of A on i x . 

The scalar product A-B of two vectors A and B is defined as |A[ jBj cos $ 
where |A| and |B| represent the magnitudes of vectors A and B, respectively, 
and 6 is the angle between them. Whether A or B comes first, the scalar 
product remains the same: 

A*B = B*A = |A| |B| cos0 (2.3) 

The scalar product can be considered as the product of the length of A and 
the length of B', where B' is the projection of B on A as shown in Fig. 2.4. 



B 




Fig. 2.4. Explanation of scalar product. 
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If 6 is greater than 90°, the value of the scalar product becomes negative. 

The scalar product of A and the addition of two vectors B and C is equal 
to the addition of the scalar product of A and B plus the scalar product of 
A and C. That is, 

A.(B + C) = A.B + A-C (2.4) 

This is proved as follows. The left-hand side of (2.4) is equal to the length of 
A multiplied by the length of the projection (B + C)' of (B + C) on A. The 
right-hand side is equal to the length of A times the length of the projection 
B' of B on A plus the length of A times the length of the projection C' of C 
on A. Referring to Fig. 2.5, however, we see that the projection (B + C)' is 




Fig. 2.5. Scalar product of a vector and the sum of two vectors. 



equal to the addition of the projection B' and C'. Thus, we have 

A • (B + C) = A • (B + C)' = |A| |(B + C)'| = |A| {|B'| + |C'|} 

= A-B'+A-C'=A.B + A-C (2.5) 

This completes the proof. From (2.5) and the relations i x -i x = 1, i x *i y = 0, 
etc., the scalar product of A and B can be expressed in terms of the x, y, z 
components as follows: 



A • B = (i x A x + \ y A y + i z A z ) • (i x B x + i y B y + i Z B Z ) = A X B X + A y B y + A Z B Z 



( 2 . 6 ) 



The vector product A x B of two vectors A and B is a vector defined as 
follows. The magnitude of the vector is equal to the area of parallelogram 
defined by A and B while its direction is normal to the plane of the parallelo- 
gram and given by the right-hand screw rule as shown in Fig. 2.6. A screw 
would advance in the direction of the vector when turned from A to B 
through the smaller angle 0 as shown. Referring to Fig. 2.6, the magnitude 
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Fig. 2.6. Explanation of vector product. 



of A x B is equal to |A| |B| sin0. The vector product B x A has the same 
magnitude as A x B. However, since the right-handed screw advances in the 
opposite direction when it is turned from B to A, the direction of B x A is 
opposite to the direction of A x B. That is, 

BxA = -AxB (2.7) 

If we draw a straight line parallel to A passing through the tip of B and take 
an arbitrary vector drawn from the tail of A to this line, then the vector 
product of this new vector and A is equal to A x B. Let B' the be one normal 
to A as shown in Fig. 2.6. Then, of course, we have 

A x B = A x B' (2.8) 

Suppose that one of the vectors in a vector product, say the second one, 
is expressed as an addition of two vectors B and C. Then, 

Ax(B + C) = AxB + AxC (2.9) 

The proof is as follows. In Fig. 2.7, dotted lines are drawn from the tips of 
vectors B, C, and B + C parallel with A, and the projections of these vectors 
on to the plane perpendicular to A are indicated by B', C', and (B + C)', 
respectively. From (2.8), we have 

A x (B + C) = A x (B + C)' 

A x B = A x B' (2.10) 

AxC=AxC' 

These vector products lie in the plane perpendicular to A since they are all 
normal to A. They are obtained by rotating (B + C)', B', and C' by 90° 
around A and multiplying the lengths by a factor |A|, respectively. From 
Fig. 2.7, we see that the projection of B + C to a plane is equal to the 
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addition of the projections of B and C to the same plane, i.e., 

(B + C)' = B + C 

Since neither the rotation of each vector in the same direction by the same 
angle nor the multiplication of its length by the same factor does change the 
additive relation between the vectors, we have 

Ax(B + C)'=AxB'+AxC' 

The substitution of (2.10) gives (2.9). 

The repeated use of (2.9) and the relations i* x i* = 0, i* x i y = i z , etc., 
gives A x B in terms of the rectangular components of A and B as follows: 

A x B = (i x A x + i y A y + i Z A Z ) x (i^B* + i y B y + i z £ z ) 

= i x (A y B z - A.B y ) + i y (A 2 B x - A X B Z ) + i z (A x B y - A y B x ) (2.1 1) 




Fig. 2.7. Vector product of a vector and the sum of two vectors. 



Treating i*, i^, i z as if they were ordinary numbers, the above formula can 
be expressed as the determinant 



i ** 

A x B = A x 
B x 

i x 



•y 

A y 




( 2 . 12 ) 



The vector product of two vectors B and C, namely, B x C, is itself a 
vector. We can, therefore, consider the scalar product with another vector 
A, i.e., A*(B x C). Referring to Fig. 2.8, B x C has a magnitude which is 
equal to the area of the parallelogram defined by B and C and a direction 
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77 
■ / 




B 



Fig. 2.8. Scalar product of a vector and the vector product of two vectors. 

normal to the plane of the parallelogram. Since the magnitude ofA-(BxC) 
is equal to the magnitude of (B x C) times the length of the projection of 
A on (B x C), it is equal to the volume, or the base area times the height, 
of the parallelepiped defined by A, B, and C. This volume remains the same 
whether the parallelogram defined by A and B or the one defined by C and A 
is considered as the base. Thus, we have 

A *(B xC) = C*(Ax B) = B-(C x A) (2.13) 

However, if C x B is taken instead of B x C, the direction of the vector is 
opposite and the volume becomes negative. 

In a similar way to the preceding discussion, we can consider the vector 
product of A and (B x C). This is represented by A x (B x C) where the 
parentheses indicate that the inside operation must be carried out first. 
Since it is perpendicular to B x C, which is in turn normal to the plane defined 
by B and C, A x (B x C) lies in this plane. This means that A x (B x C) is 
expressible as a linear combination of B and C, i.e., the sum of some coeffi- 
cients times B and C. These coefficients are found to be (A*C) and 
- (A*B), respectively. That is, 

A x (B x C) = (A ♦ C) B — (A • B) C (2.14) 

Since this formula will be used several times, we shall prove it as follows. 
First consider the special case in which A is equal to B. In Fig. 2.9, a dotted 
line parallel to B is drawn passing through the tip of C, and C' is drawn 
perpendicular to B from the tail of B to the dotted line. From (2.8), B x C 
is equal to B x C'. The vector C - C, drawn from the tip of C to that of C\ 
is parallel to B but the direction is opposite. Its magnitude is equal to that 
of the projection of C on B, which is given by |C| cos 9 where 9 is the angle 
between B and C. Since this magnitude can be expressed as (|B| |C| cos 9/ 
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|B| 2 ) |B|, C' - C is equal to - {(B-C)/(B-B)} B. From this, we have 

C = C-B(BC)/(BB) (2.15) 

Noting that B x (B x C') is perpendicular to both B and B x C', it follows 
from Fig. 2.9 that the direction of the vector B x (B x C') is parallel but 
opposite to C' and its magnitude is given by |B| (|B| jC'|). Therefore, we 
have 

B x (B x C') = — (B • B) C 





Substituting (2.15) in the right-hand side and using the fact that B x C' is 
equal to B x C, the desired result 

B x (B x C) = (B • C) B — (B • B) C (2.16) 

is obtained. This shows that (2.14) is correct in this special case where A = B. 

Let us now consider the general case in which A is not necessarily equal 
to B. From the explanation preceding (2.14), A x (B x C) can be written 
in the form 

A x (B x C) = kjB + k 2 C (2.17) 

We must, therefore, determine the coefficients k i and k 2 . To do so, first 
multiply (2.17) by A* from the left, then we have 

A -A x (B x C) = k, (A ♦ B) + k 2 (A • C) 

The left-hand side of this equation represents the volume of parallelepiped 
defined by A, A, and B x C. However, since the two edges coincide with 
each other, the volume is zero. Setting the right-hand side equal to zero, we 
obtain 



k 2 = -&! (A • B)/(A • C) 
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Substituting into (2.17), we have 

A x (B x C) = ic, {B - C(A-B)/(A *C)} (2.18) 

Multiplying (2.18) by B*, we have 

B • A x (B x C) = k t {(B • B) - (B • C) (A • B)/(A • C)} (2.19) 

Considering (B x C) as the third vector in the left-hand side of (2.13), the 
left-hand side of (2.19) is seen to be equal to — A*B x (B x C). Therefore, 
applying (2.16), we obtain 

B A x (B x C) = -A B x (B x C) = - A {(B • C) B - (B B) C} 

= (B • B) (A • C) — ( A • B) (B • C) 

Comparing this with (2.19), we see that 

— (A • C) 

Finally, substitution into (2.18) gives the desired relation (2.14). 

Let us now study the differentiations of vectors. There are two different 
kinds of differentiation, divergence and rotation, just as there are two 
multiplications, scalar product and vector product. The divergence (V*A: 
del dot A) is defined by 

V *A = lim | A n dS (2.20) 

av-o AV J AS 

where the integral is over the closed surface of a small volume element A V 
and n is the outer normal unit vector, i.e., the unit vector normal to the 
surface and pointing outward. If A represents the velocity of incompressible 
liquid, since the integral gives the amount of the liquid emerging from the 
closed surface, V *A represents the amount of the liquid generated per unit 
volume. The divergence V*A is a scalar quantity which has no direction 
associated with it. The integration of V *A over a certain volume V is 
equivalent to the summation of V*A AV over the same volume. Because 
of the definition, V-A AV can be written in the form of a surface integral. 
Since the outflow through a surface from A V means the inflow through the 
same surface for the adjacent A V , when summing the surface integrals, the 
contributions from such common surfaces cancel each other. Thus, there 
remains only the contribution from the outermost surface which is not 
shared by the neighboring volume elements : 

j* V-A dv = j A-n dS 



( 2 . 21 ) 
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where 5 is the closed surface of the volume V and n is the outer normal unit 
vector as shown in Fig. 2.10. This is called Gauss’s theorem. 

Let us consider how to express the divergence in terms of the rectangular 
components A x , A y , and A, of the vector A. The shape of AV is irrelevant 
to the final result, however, to avoid unnecessary complication, let AV be a 
cube whose edges are parallel to the .v, y, and z axes as shown in Fig. 2.11. 




x 

Fig. 2.11. Volume element for divergence calculation. 



Let (x, y, z) be the center of the cube. The contribution to the surface integral 
from the two surfaces perpendicular to the xr-axis is given by 

j A‘ndS + j A*n dS 

= A x (x + \Ax, y, z) Ay Az — A x (x — }Ax, y, z) Ay Az 
= { A x( x , y> z ) + ±Ax(dAjdx)}AyAz - {A x (x, y, z) - $ Ax(dAJdx)}Ay Az 
j f = (dAJdx) Ax Ay Az 

i i 
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Similarly, the contributions from other surfaces can be calculated, and when 
all are added, we have 

f . ... (dA r dA u dA,) 



f * \ dA x dA dAA 

]j- DdS = [^ + Jy- + -^\ AX/,yAz 

Dividing by AV = Ax Ay Az and taking the limit of AV-* 0, we obtain 



v. A JA + B A + d A 

dx dy dz 



Let V be an operator defined by 



d . d d 
* ~ Ix dx + ' y dy + ' lz dz 



( 2 . 22 ) 



(2.23) 



Treating d/dx, djdy , and djdz as if they were the rectangular components of 
a vector, when we calculate the scalar product of V and A, we get exactly 
the same expression as (2.22). For this reason, we indicate the divergence of 
A by V*A. 

Let us consider a special case in which A is a vector in a two dimensional 
space, i.e., A is parallel to the xy plane and is a function of x and y only 
(independent of z). Let V be a volume S 0 x 1 where S 0 is the base area, and 
the height is unity as shown in Fig. 2.12. The integration of V-A over V is 
given by 

f V-A dV = f V-A dS x 1 

J v J s 0 



'K v .t 



Fig. 2.12. Explanation for two-dimensional Gauss’s theorem. 
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By Gauss’s theorem, this is equal to the surface integral of A*n over the 
closed surface S of V. However, since by hypothesis, A is parallel to the xy 
plane, the contributions from the top and bottom surfaces vanish, and hence 
the surface integral over S is equal to the contribution from the cylindrical 
surface whose length is L : 

I A*n<iS=r A*nd/xl 

J s J L 

From the above two equations and Gauss’s theorem, we obtain 

f V-A dS= f A-n dl (2.24) 

J So J L 

This is the two-dimensional Gauss’s theorem. A comparison of (2.21) and 
(2.24) shows that the two dimensional case is obtainable simply by changing 
the volume and surface integrals in the ordinary Gauss’s theorem to the 
surface and line' integrals, respectively. 

The other differentiation, the rotation of vector A, is indicated by V x A 
(del cross A). This is a vector quantity in contrast to the divergence which 
is a scalar quantity. The component of the rotation in the direction of an 
arbitrary unit vector n is given by 

n • V x A = lim — f A • dl (2.25) 

^oAS} ac 

where AS is a small area with n being normal to it, AC is the closed contour 
of AS and dl is the tangential vector of AC with a magnitude equal to the 
length of the small segment of AC, as illustrated in Fig. 2.13. The integration 
of n*V x A over a surface S is equivalent to the summation of a large 
number of quantities n-VxA /IS over the surface. Each can be converted 




Fig. 2.13. Vector relation in the definition of rotation. 
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to the line integral along the closed contour AC. However, the contributions 
from the common borderline of the neighboring areas cancel each other since 
the directions of the integrals are opposite, as shown in Fig. 2.14. The re- 
maining contribution comes from the outermost periphery of S only. 
Therefore, we obtain Stokes’s theorem: 

J V x A • n dS = J A-dl (2.26) 

where C is the closed contour of S. The direction of the contour integral is 
shown in Fig. 2.14. 




Fig. 2.14. Explanation of Stokes’s theorem. 



Let us consider how V x A should be expressed in terms of the rectangular 
components. If n is the unit vector in the ^-direction, then, (2.25) gives the 
x-component of V x A, namely (V x A),. The shape of A S is arbitrary as 
long as it is small and perpendicular to n. However, in order to make the 
calculation simple, let us consider a rectangle with sides parallel to the 
y- and z-axes, as shown in Fig. 2.15. The center of the rectangle is given by 
(x,y, z) and the lengths of the sides are Ay and Az, respectively. Then we 
have 

(V x A)* = lim (Ay AzY 1 | A *dl 

J AC 

= lim (Ay Az )~ 1 {A z (x, y + \ Ay, z) Az - A z (x, y - \ Ay, z) Az 
+ A y (x, y, z — \Az) Ay - A y (x, y, z + \Az) Ay} 

= lim(dy Az)~ 1 {(dAJdy) Ay Az - (dAJdz) Az Ay} 

= (dAJdy) ~ (dAJdz) 



In a similar manner, other components can be obtained. Combining the re- 
sults, we obtain 
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z 




expression; hence, the rotation of A is indicated by V x A. Equation (2.27) 
can also be expressed in determinant form similar to (2.12). 

ijr b 

V x A = djdx d/dy djdz (2.28) 

A x A y A z 

Since V x A is a vector, we can consider the divergence of V x A, V • V x A. 
By definition, 

1 1 

V*VxA = lim — VxA-n dS 

AV-+ 0 AV Jjs 
1 

= lim — A • dl 
av-o AV AC 

where Stokes’s theorem is used, AS is the closed surface of AV and AC is 
the closed contour of AS. However, since AS is a closed surface, the contour 
AC shrinks to a point and its length becomes zero. Thus, the contour integral 
is always equal to zero. This means that 

y- V • V x A = 0 ( 2 - 29 ) 

Since V x A is a vector, we can also consider the rotation of V x A, 
\ x (V x A). However, in order to simplify the explanation, let us first 

% 
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discuss a differentiation of scalar functions, which leads to a vector. A point 
in actual space can be specified by a vector drawn from the origin to the 
point. Therefore, let us identify each point by a vector r. Let rj and r 2 be 
two points close to each other, and let <p be a single valued function whose 
values at r 1 and r 2 are indicated by ip t and <p 2 , respectively. In the limit of 
r t — r 2 = dr -> 0, let us express dtp = (p x — ip 2 in the form of a scalar product 
of dr and some vector \<p, i.e.. 



dip = \(p • dr 



(2.30) 



The vector \(p defined through (2.30) is called the gradient of (p. Suppose dr 
lies on a constant <p surface, then d<p is equal to zero and so is \<p'dr by 
definition. This means that V<p is perpendicular to dr. Since the direction of 
dr is arbitrary as long as it lies on the surface, it follows that V<p is normal 
to the constant (p surface. When (p changes rapidly with distance normal to 
the constant q> surface, from the definition (2.30) the magnitude of \cp is 
large and vice versa. This is the reason why \(p is called the gradient of <p. 

The integral of (2.30) along a closed path C must be equal to zero: 



w. 



\(p • dr = 0 



This is because (p has the same value at both ends of the path which are 
really one and the same point. Since C is arbitrary, the definition of the 
rotation (2.25) shows that V x V<p is always equal to zero, i.e., 

V x = 0 (2.31) 

Since \<p is a vector, we can consider the divergence of \(p, V*V<p which 
is sometimes written in the form \ z (p. 

Let us derive an expression for \(p in terms of its rectangular components. 
In the rectangular coordinate system, we have 

dr = i X dx + i y dy + i, dz 

d(p d(p dip 

d<p = - - dx + —dy + — dz 

ox dy dz 

Substituting these expressions into (2.30), we obtain 

d(p d(p 6(p _ 

— dx + dy + — dz = (i* • \<p) dx + (i • V<p) dy + (i z • \ip) dz 

8x dy dz 
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Since dx, dy, dz are arbitrary infinitesimals, the above relation requires 



= V V< P, 



= ly ' , 



= i z ■ V<p 



This means that the x, y, and z components of V</> are given by d(p/dx, 
d<p/dy, and dtp/dz, respectively. Thus, we have 



dip dip dip 

= i* — + iy — + i z — 

dx dy dz 



(2.32) 



If <p is placed after the operator V defined by (2.23), the same expression can 
be obtained, and it is for this reason that the gradient of <p is indicated by \(p. 

Since the divergence of a vector is obtainable by formally taking the scalar 
product of V and the vector in the system of rectangular coordinates, the 
expression for V*V<p is given by 



d 2 (p d 2 <p d 2 (p 

V.V<p = —3 + --V + -4 
dx 2 dy 2 dz 2 



(2.33) 



which is simply the scalar product of V and (2.32). The divergence of a 
vector A is a scalar function and we can, therefore, consider the gradient. 
This is expressed by V(V-A), where the parentheses indicate that the inside 
operation must be performed first. 

We are now in a position to discuss the rotation of V x A, which was 
previously postponed. Treating V as if it were a vector defined by (2.23), 
the formal application of (2.14) to V x (V x A) gives 

V x (V x A) = (V • A) V — (V • V) A 

Since B in the first term on the right-hand side of (2.14) can be placed in 
front of (A*C) without changing the relation, and since V is a differential 
operator on A which is customarily placed in front of A, let us move V 
to come before (V*A) in the first term on the right-hand side of the above 
equation. This gives 

V x (V x A) = V(V-A)-(V-V)A (2.34) 



The first term on the right-hand side is the gradient of the divergence A. 
The second term is similar to the divergence of a gradient. Since A is a 
vector however, (V*V)A has yet to be defined. If we use (2.34) to define 
(V*V)A and also use V 2 to indicate (V*V), even when a vector follows it, 
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then we have 

(V- V) A = V 2 A = V(V *A) — V x (V x A) (2.35) 

Since each term on the right-hand side is a vector, V 2 A thus defined is a 
vector. 

Let us derive an expression for V 2 A in the rectangular coordinate system. 
In this system the divergence, gradient, and rotation are all obtained by the 
formal application of V defined by (2.23) and since the definition of V 2 A 
is obtained by the formal application of vector multiplication formula (2.14) 
to V x (V x A), if we place A after the scalar multiplication of two V’s as 
defined by (2.35) the expression for V 2 A is obtained: 



V 2 A = 




d 2 A d 2 A d 2 A 
dx 2 + dy 2 dz 2 




(2.36) 



In the rectangular coordinate system, \ 2 (p and V 2 A have exactly the same 
form, as we can see from (2.33) and (2.36). This is not necessarily true, 
however, in other coordinate systems. When V 2 A is defined through (2.35), 
it generally has a different differential form from that for \ 2 (p in the same 
coordinate system. 

Let us next consider the differentiation of the product of functions. In 
ordinary differentiation, the derivative of a product of functions is given by 
the sum of terms in which only one of the functions is differentiated with the 
others remaining unchanged. For example, the derivative of fg is given by 
f 'g + fQ' where the prime indicates the derivative. Since V can be expressed 
in the form of (2.23), a similar operation is possible. Indicating the differential 
operators on A and on B by \ A and V B , respectively, we have 



V • (AV • B) = V^ • (A V • B) + V B • (AV ♦ B) = (V • A) (VB)+AVVB 
Similarly, we have 



(2.37) 



V • ( A x V xB) = V x A * V xB-A'V x V xB (2.38) 



The negative sign in front of the second term on the right-hand side is the 
result of interchanging the order of V and A during the derivation. The 
rectangular coordinate system was used to derive (2.37) and (2.38). Once 
both sides of an equation become vector expressions, however, the equation 
is valid, independent of the coordinate system utilized. If we take the volume 
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integrals of (2.37) and (2.38), by Gauss’s theorem, the left-hand sides become 
the surface integrals. After the proper transposition of terms, we have 

| A* VV -B dV = — J (V *A) (V *B) c/F + J (n-A)(V.B ) dS 

(2.39) 

| AV x V x BdF = J V x AV x B ciV - J A x V x B-n^S 



A x V xB-ndS 



(2.40) 



These are formulas of integration by parts. When A and B are two- 
dimensional vectors, one can obtain the corresponding formulas by changing 
volume integrals to surface integrals and surface integrals to line integrals 
just as we did in the two-dimensional Gauss’s theorem. 

f AVVB = - f (V-A)(V-B )dS+ f (n.A)(V-B)d/ (2.41) 

J S J S J L 

[ A-VxVxB dS= ( VxA-VxBdS - [ AxVxB-n dl (2.42) 

The differentiation of a function of a function can also be discussed in a 
similar manner to the ordinary differentiation of a function of a function. 
As an example, let us consider Vr where r is a scalar given by(r-r) 1/2 . Since 
Vr 2 is in the form of the differentiation of a function of a function, it is given 
by 2r Vr. On the other hand, from the differentiation of a product of func- 
tions, we have 

Vr 2 = V(r • r) = 2 (r ♦ V) r 

= 2{x ( djdx ) -(- y (djdy) + z {djdz)} (i^x + i y y + i Z z) = 2 r 



Combining these two results, we obtain 

Vr = (r/r) 

The same result is obtainable through the following manipulation. 

Vr = V(r-r) 1/2 = i(r.r) _1/2 2r-Vr = (r/r) 

where the relation r*Vr = r calculated for (2.43) is used. 

Let us apply (2.43) to the calculation of V 2 (l/r). Since 



(2.43) 



V (1/r) = — Vr/r 2 = — r/r 3 



(2.44) 
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V - r = {i x (d/dx) + i y (d/dy) + i z (djdz)} *(i x x + i y y + i Z z) = 3 

Our next task is to derive Helmholtz’s theorem which states: A differenti- 
able but otherwise arbitrary vector function can always be expressed as the 
sum of the gradient of a scalar function and the rotation of a vector function. 
Before proceeding, let us first prove the following lemma: One of the 
solutions of Poisson’s equation 

\ 2 <p = -q (2.46) 

is given by 

(2 - 47) 

where R is defined by 

K = {(r-r'Mr-r')} I/2 (2.48) 



We shall indicate the differentiations with respect to r and r' by V and V", 
respectively. Let us apply V to (2.47) and then integrate the result over a 
closed surface S. Interchanging the order of integrations on the right-hand 
side, we have 

V<p -n dS = (T — V - n dS dv (2.49) 

Js JJs 4ti R 

If r' is located outside the S, the surface integral on the right-hand side can 
be converted to a volume integral, i.e.. 



f q(r') 1 f q(r') 1 

— V-«n dS=\ — \'V-dv 
J s 4ft ^ J v 4ft R 



where V is the volume enclosed by S. Since V*V(I /R) is equivalent to 
V*V(1 /r) with origin at r\ from (2.45) V*V(1 /R) is equal to zero provided 
that R does not vanish. From this it follows that, when r' is outside S, the 
left-hand side of (2.50) is equal to zero. If r' should be inside S, R becomes 
zero at r = r' and the integrand on the right-hand side of (2.49) loses the 
meaning at this point. To avoid this difficulty, let us consider a small 
sphere centered at r' with volume v 0 and surface S 0 . Using Gauss’s theorem. 
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we have 



q(r) 1 f g(r') 1 f q(r') 1 

^--'V~.n dS=\ 'V-V-rfF- (2.51) 

Js 4ft R jv-vo 4ft R Js 0 4ft R 

where n is the outer normal unit vector from V — v Q . Note that n is directed 
toward the center of the sphere on S 0 . The first term on the right-hand side 
of (2.51) is equal to zero since V‘V(l/i?) vanishes everywhere in the three- 
dimensional region defined by (V — v 0 ). The second term can be calcualted 
as follows. Since 

n = - (r - r')/R 

on S 0 , and since (2.44) clearly shows 



V(l/i?) = -( r - r ')//? 3 

we have 

( ®JL) v L. nds= [ jM iS . 

Js 0 4ft R J s 0 4ft/? 

The integrand on the right-hand side is a constant on S 0 and the surface 
area of S 0 is 4fti? 2 , hence the value of the integral is equal to </(r'). From 
this and (2.51), it follows that when r' is inside S , 



r 9(0 i 

I V -*n 

Js 4ft R 



dS = -q (r') 



If r' is outside S , the surface integral is equal to zero as we showed before. 
Therefore, the contribution to the volume integral on the right-hand side of 
(2.49) comes from the elementary volume dv' inside S, i.e., from the volume 
V only. That is, 

j* V<p.ndS = -J q{t')dv' 

By Gauss’s theorem, the left-hand side becomes the volume integral of 
V*V</>. The primes in the left-hand side can be omitted without changing the 
result. Therefore, we obtain 



J {V • Vq> + g(r)} dv = 0 



Since V is arbitrary, it follows from this that (2.47) satisfies (2.46). This 
completes the proof of the lemma. 
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In order to get Helmholtz’s theorem, we next apply the above result to 
the following vector version of Poisson’s equation 

V 2 W = - F (2.52) 

Since in the rectangular coordinate system, (2.52) is satisfied if each compo- 
nent satisfies Poisson’s equation (2.46), 



J 4nR 



must be a solution of (2.52). On the other hand, from (2.35) and (2.52,) we 
have 

F = — V 2 W = — V (V * W) + V x (V x W) (2.54) 

To obtain an explicit form of F, let us calculate the divergence and rotation 
of (2.53) and substitute into the above equation. — V* W is given by 

C F I C F 1 

— V* W = — \ — .\-dv' =]-•?'- dv' (2.55) 

J 4k R J 4tc R 



Using the relation 



F V' • F 1 

V' — = + F* V - 

R R R 



we can rewrite the right-hand side of (2.55) and then apply Gauss’s theorem. 
The result is 



F-n , f V'-F , 

V • W = - — dS'- dv' 

J 4rcR J 4nR 



On the other hand, V x W is given by 



C F 1 f F 1 

V x W = - — x V - dv' = — xV-dv' 

J 471 R J 4n R 



Using the relation 



, F V' x F „ _ 1 

V' x - = — - F x V' - 

R R R 



the above equation becomes 



fV'xF f F 

V x W = dv' - V' x — dv' (2.57) 

J 4nR J 4nR 



i \ 
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If K is a constant vector, using Gauss’s theorem, we have 



F f F 

K x dv' = — K • V' x dv' 

4nR ‘J 4kR 

(kx- .nd5' = K.f 

J 4? zR J 



x n dS' 



Since K is arbitrary, it follows from this that 



f V' x — do' = f — x n dS' 
J 4nR J 4nR 



Substituting this into the right-hand side of (2.57), we have 



VxW = 



f V' x F , 

= dv' + 

J 4nR J 



F x n 



(2.58) 



The primes on the right-hand sides of (2.56) and (2.58) can be omitted with- 
out changing these results. Substituting (2.56) and (2.58) into (2.54), we 
obtain the desired expression for Helmholtz’s theorem. 



F = V 



n dS - 



+ V x 



F x n 



V x F 



This shows that an arbitrary, but differentiable, vector function F can be 
expressed as the sum of the gradient of the scalar function in the parentheses 
of the first term and the rotation of the vector function in the second term. 
As is easily seen, V*F = 0 (in volume V) alone is not sufficient for F to be 
expressible as the rotation of a vector function. On the other hand, V*F = 0 
(in volume V) and F*n = 0 (on surface S) are sufficient to ensure F to be 
the rotation of a vector. Similarly, V x F = 0 (in volume V) alone is not 
sufficient, but together with F x n = 0 (on surface S), it guarantees that F is 
the gradient of a scalar function. 

Before closing this section, let us derive the expressions for V*A, \<p, 
and V 2 (p in the cylindrical coordinate system, for later use. We apply the 
definition (2.20) for the divergence to the volume element A V given by 
Ar • rAO ■ Az shown in Fig. 2.16. The surface integral is given by 



{A r (r + Ar)-Az-(r + Ar) Ad — A r (r) Az-r Ad } 

+ {A e (d + Ad) Ar- Az — A g (9) Ar- Az) 

+ {A z (z + Az) Ar-r AO — A.(z) Ar-r Ad) 

= (dr AJdr) Ar Az AO + (dA e jdO) Ar Az Ad -I- (dAJdz) Ar AzrAO 
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2.2. Maxwell’s Equations 
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z 




Fig. 2. 1 6. Volume element A V in cylindrical coordinate system. 




where the first, second, and third terms express the contribution from the 
r-constant, 0-constant, and z-constant surfaces, respectively, and use is 
made of the abbreviated forms of functions in which only that coordinate 
of primary concern is explicitly given in the parentheses. Dividing the above 
expression by A V, we obtain 



Id, x 18A g dA, 

V • A = - - (rA r ) + - - - e + — 

r dr y rJ r 80 dz 



(2.60) 



In order to obtain the expression for V<p, let i r , i 0 , and i z be the unit 
vectors in the r, 0, and z directions. The elementary displacement dr is 
given by 

dr = i r dr + i 0 r dd + i z dz 



The increment dtp of a function <p due to the displacement dr is given by 



i,.* *+%*+*>* 

or 06 dz 

( d<p 1 d<p d(p 
i 'fr +i ‘-rS 0 +K 6 z 
= \(p»dr 



(i r dr + i 0 r dd + i z dz) 



•r> 







It follows from this that 



d(p 1 d(p d(p 

\q> = i r — + i* - — + i z — 
dr r 80 dz 



(2.61) 



Substituting this in place of A in (2.60), we obtain 



Id/ 8q>\ 1 d 2 q> d 2 (p 

V.Vc, = -~r- +- T ~4 + -4- 
y rdr\ dr r 2 8Q 2 dz 2 



(2.62) 



which is the expression for V 2 <p in the cylinderical coordinate system. 



2.2 Maxwell’s Equations 

If we take a coil having many turns with a voltmeter connected between 
its terminals and then move a magnet so that its flux threads the coil, the 
pointer of the voltmeter moves, indicating that some voltage is developed 
between the terminals. Since the voltage disappears when the magnet is 
stationary, the terminal voltage seems to be related to the time variation of 
the magnetic flux passing through the coil. By increasing the size of magnet, 
reversing the polartiy, or by varying the speed with which the magnet 
approaches the coil, one is gradually convinced that the terminal voltage is 
proportional to the time derivative of the total flux crossing the coil, d&fdt. 
Keeping the motion of the magnet relative to the coil the same, if the 
number of turns of the coil is increased, the terminal voltage increases 
proportionally. From this, it is reasonable to assume that a voltage appears 
in a coil with only one turn, which is proportional to the time derivative of 
the flux through the turn, i.e., 



V oc dd>jdt 

where cc indicates proportionality. On the right-hand side <P can be expressed 
as the integral of the scalar product of magnetic flux density B, i.e., magnetic 
induction, and the normal unit vector n over a surface S surrounded by the 
contour C which coincides with the coil as shown in Fig. 2.17. Also V is 
equal to the negative of the integral of electric field E from p to p, however, 
the existence of V depends more on the coil surrounding the flux than on 
the gap between p and p . Since E must be small along the conductor, the 
limit of the integral from p to p' can be extended to include the entire 
closed contour C without changing the value appreciably. The above 
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Fig. 2.17. Explanation of electromagnetic induction. 



proportionality relation can then be rewritten in the form 



- f E-dloz ~ f Bn dS 

Jc 



Let use choose the unit of B so that the constant of proportionality becomes 
unity, i.e.. 



| E -dl = -~ ( B-n dS 

Jc ^Js 



(2.63) 



where the direction of C is such that when a right-handed screw is turned in 
that direction, it will advance in the direction of n. 

Next, suppose a conducting wire carrying a heavy current passes vertically 
through the center of a sheet of paper with iron powder scattered on it. 
The iron powder will indicate that a magnetic field surrounds the current. 
A more elaborate experiment tells us that the magnetic field intensity is 
proportional to the magnitude of the current 7 and is inversely proportional 
to the distance from the conductor. Consider the contour integral of magnetic 
field H along a circle C 0 with the center at the conductor and radius r on a 
plane perpendicular to the conductor. Since the length of the contour is 
proportional to r and the intensity of H inversely proportional to r, the value 
of the integral becomes independent of r. However, since H is proportional 
to 7, we have 



f H 

J Co 



dice I 



The current can be expressed as the surface integral of the scalar product of 
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the current density i and the normal unit vector n over a surface S surrounded 
by C 0 . The above result then becomes similar to (2.63), for which the shape 
of C is arbitrary. Therefore, it may be natural to assume that 






(2.64) 



where C is an arbitrary closed contour and S' is a surface surrounded by C. 
The constant of proportionality is made unity by properly selecting the 
unit for H. 

Let us first check the validity of (2.64) for the special case of the vertical 
conducting wire discussed above. If we use the cylindrical coordinate 
system with the z-axis coinciding with the conductor, then since 

Hoci„(7/r) 

and 

dl = i r dr + i 0 r dQ + \ z dz 
the contour integral of H becomes 



/ cH . di =cJ t 



which is equal to 2nl when C encloses the conductor and is equal to zero, 
otherwise. If In is included in the constant of proportionality, this result 
confirms (2.64) for this special case. However, if we assume that (2.64) is 
always correct, we encounter some difficulties, especially when a capacitor is 
inserted in the conducting path as shown in Fig. 2.18. A magnetic field 



CAPACITOR - 
ELECTRODES 



~rr~ %>,' 

it ^ 



FIXED 

CONTOUR 



Fig. 2.18. Explanation of displacement current. 
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surrounds the conductor as long as current is flowing through it, and if S' 
is chosen so as to cut the vertical conducting wire in Fig. 2.18, there is no 
problem. However, if S passes between the electrodes of the capacitor, as 
shown in Fig. 2.18, the current density i is zero everywhere on S, and 
hence (2.64) leads to a contradiction that the line integral of H along the 
same contour C becomes zero or finite, depending on how S is chosen. This 
necessitates some modification of (2.64). When current is flowing into a 
capacitor, it is gradually charged and the electric field between its electrodes 
increases with time. Therefore, dD/dt exists in place of i, where D = eE 
is the electric displacement. Let us call dD/dt the displacement current 
(although nothing is displaced in vacuum) and add this to the current in the 
right-hand side of (2.64) to take care of the above difficulty: 



I H -dl = (i + 

Jc Js V 



where the magnitude of £ is so chosen that the constant of proportionality 
in front of dD/dt becomes unity. 

We have now obtained (2.63) and (2.65) as the most plausible equations 
expressing electromagnetic field relations. Although the left-hand sides of the 
two equations are contour integrals whereas the right-hand sides are surface 
integrals, we can convert the contour integrals to surface integrals by 
Stokes’ theorem. After transpositions, we have 



I( v - 



dB\ 

EH ) *n <2S = 0 

dt 



V x H — I i + — J > • n dS = 0 



(2.67) 



Since S is arbitrary, the integrands have to be zero : 



V x E = 



( 2 . 68 ) 



dD 

V x H = i + — 
dt 



(2.69) 



Since there is always electric charge g at the terminal point of D which is 
considered as the source of D, from the definition of divergence (2.20), we 
have 

VD = p (2.70) 
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where the unit of charge is again properly chosen so that the constant of 
proportionality is unity. On the other hand, positive and negative magnetic 
charges are always found in pairs, and in any finite volume there is no net 
magnetic charge. This experimental fact is expressed in the form 

V*B = 0 (2.71) 

The four equations (2.68), (2.69), (2.70), and (2.71) are called Maxwell’s 
equations. 

Taking the divergence of (2.69) and noting that the order of differentia- 
tions with respect to time and position can be interchanged, we obtain 

a 

V»i + — VD = 0 

dt 

since the divergence of rotation is always equal to zero. The substitution of 
(2.70) into the equation gives 

. dg 

V *i = — (2.72) 

dt 7 

This expresses the conservation of charge since by Gauss’s theorem it means 
that the decrease of g in a certain volume V is due to the outflow of g in the 
form of current through the surface S enclosing V. Similarly, taking the 
divergence of (2.68), it can be shown that V • B is a constant independent of 
time. This constant is chosen to be zero in (2.71) to conform with experi- 
ments. 

As we have already discussed, the choice of the units for electromagnetic 
quantities is made in such a way that the constants of proportionality 
become unity in Maxwell’s equations. The unit of each quantity appearing 
in the equations is listed below: 



E: 


[volt]/ [meter] 


[V/m] 


H: 


[ampere-turn]/ [meter] 


[AT/m] 


B: 


[weber]/ [meter] 2 


[Wb/m 2 ] 


D: 


[coulomb]/[meter] 2 


[C/m 2 ] 


i: 


[ampere]/ [meter] 2 


[A/m 2 ] 


g: 


[coulomb]/ [meter] 3 


[C/m 3 ] 



Maxwell derived his equations through more or less the same argument 
described above after being stimulated by the experimental results of Faraday 
and others. Mathematically, the equations do not contradict each other, but 
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this does not mean that they are proved. In fact, there is no proof that 
electromagnetic phenomena should obey Maxwell’s equations which must, 
therefore, be considered as bold postulates. The only support for the validity 
of Maxwell’s equations is the fact that no macroscopic experiment has been 
conducted successfully which disproves them. Although there is no proof, 
let us accept their validity and consider how various phenomena can be 
explained or what kind of phenomena should be expected as natural con- 
sequences of Maxwell’s equations. For instance, one argument goes as 
follows: Equation (2.69) gives (2.67) which in turn becomes (2.65), and it 
follows from this and a symmetry argument that the intensity of H around 
a straight conductor carrying a current / is proportional to I and inversely 
proportional to the distance from the conductor. Although this explanation 
of Maxwell’s equations might sound strange to some of us, it should be 
remembered that physics is generally based upon similar foundations. Let us 
consider Newtonian mechanics. From experiments, it was inferred that force 
f was proportional to mass m times acceleration, i.e., 

f = m ^ (2.73) 

dt 

where r indicates the position. By assuming that this equation always holds, 
a number of useful theorems were deducted from it. There was no proof for 
(2.73), in fact, it was later discovered that when the velocity of an object 
approached light velocity, (2.73) was no longer adequate and this led to the 
theory of relativity. In spite of these limitations, (2.73) is a very valuable 
result and worth studying together with the theorems deduced from it. 
Similarly, some macroscopic phenomena might be discovered in the future 
which disprove the universal validity of Maxwell’s equations. Then, the 
equations will have to be revised accordingly. Nevertheless, the present 
form of Maxwell’s equations will remain worth studying which is one of the 
objectives of this book. 

In ordinary media, D and B are approximately proportional to E and H, 
respectively. The constants of proportionality are generally indicated by 
e and /x: 

D = eE (2.74) 

B = fiH (2.75) 

Since the units of D and B have been chosen so as to make the constants of 
proportionality in Maxwell’s equations unity, e and n for a vacuum are not 
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unity but have the following values: 

e 0 = 8.854 x 10“ 12 [farad/meter] 

= 1.257 x 10" 6 [henry/meter] 

where the subscript 0 is used to indicate the values for vacuum. In other 
media, e and fj. are different from £ 0 and n 0 . Therefore, we write as 




e = £ r £ 0 
l 1 = M o 

and call e r the relative dielectric constant and jti r the relative permeability 
of the material. The current density is also approximately proportional to 
E in ordinary materials, i.e., 

i = oE (2.76) 

The constant of proportionality a is called the conductivity of the material. 

Equations (2.74)-(2.76) are approximations. However, if we consider an 
idealized model for which these relations hold, then (2.68) and (2.69) 
become 



dH 

V x E = — u -- - 
dt 


(2.77) 


dE 

V x H = <rE + £ — 
dt 


(2.78) 




These equations are linear and the coefficients are real, therefore the same 
arguments we used in the discussion of transmission line theory can be 
applied here, and the same time factor e jV)l can be used for the electromagnetic 
field. The differentiation with respect to time becomes equivalent to the 
multiplication by jco. Thus, (2.77) and (2.78) become 

V x E = - ja)fiU (2.79) 

V x H = cE + j(oe E (2.80) 




These are Maxwell’s equations in the present case. Equation (2.71) can be 
derived from (2.79), and (2.70) from the conservation of charge and (2.80). 

Let us turn our attention to energy. Substituting (2.77) and (2.78) into 
the identity 

V E x H = H V x E-E V x H 
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and integrating the result over a volume V, we obtain 

J ExH*ndS = — j* a E 2 dv-~j i(eE 2 + fiH 2 ) dv (2.81) 

where the left-hand side is converted into the surface integral by Gauss’s 
theorem. The surface enclosing V is S, and E 2 is an abbreviation for E*E. 
The vector E x H appearing in the left integral is called Poynting vector. 

Let us assume that there are electrodes at the top and bottom surfaces 
of a small cylindrical volume A V whose axis is parallel to E, as shown in 
Fig. 2.19, and consider the power consumption in AV. The voltage between 




Fig. 2.19. Explanation of power consumption in conductive medium. 

the electrodes is given by |E| Al and the current by a|E| AS. The ohmic loss 
is given by voltage times current which is equal to ctE 2 AS Al = aE 2 AV. 
Eliminating the electrodes, the same amount of power is considered being 
consumed since the electric field in A V stays the same. Therefore, j K <rE 2 dv 
should give the total power consumption in V. Since this term is related to 
energy, the other terms in (2.81) must also be related to energy. Although 
there is no particular reason for the following approach, for convenience, let 
us argue as follows : The integral of the Poynting vector over 5j s ExH*n dS, 
represents the power flowing out through S, j K |eE 2 dv is the electric energy 
stored in V and J K %/j.H 2 dv is the magnetic energy stored in V. Then, (2.81) 
expresses the conservation of energy since the sum of the power consumed 
in V and the outflow of power through S becomes equal to the rate of de- 
crease of the total stored energy. 

Let us next consider the case in which the electric and magnetic fields 
have the time factor e jtot , Maxwell’s equations are then given by (2.79) and 
(2.80). The average power consumption in V can be calculated by multi- 
plying E by yjl e Jbit and taking the real part for the interpretation of E in the 
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2 ? 



equations. Therefore, for oE 2 , we have 

o-jRe^S Ee Jo> ‘)} 2 = 2a (cos tot (Re E) - sinafi(ImE)} 2 

= 2a {cos 2 cot (Re E) 2 - 2 cos cot sin cot (Re E) • (Im E) + sin 2 cot (Im E) 2 } 

We integrate this over one period T and divide the result by T to take the 
time-average. The average of cos 2 cot over a period is seen to be one-half from 
Fig. 2.20. Similarly, the average value of sin 2 ot is given by while the 
average of cos cot sin cot is zero. Therefore, the average power consumption 
is given by 

J T~ l J T a [Re(^/2 Ee jo>t )~\ 2 dt dv = j ^a {(ReE) 2 + (ImE) 2 } dv 

= J aE-E*dv 





Fig. 2.20. Graphs of cos 2 cot and cos cot sinw/. 
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A similar discussion shows that the average electric and magnetic energies 
stored in V over a period are given by \ v ^eE*E* dv and J K dv> 

respectively. 

Now, subtracting H**(2.79) from E*{the complex conjugate of (2.80)} 
and integrating the result over V, we have 



J (E • V x H* — H* * V x E )dv 

= J <rE-E* dv + jco j (fiH-H* -eE'E*)dv (2.82) 



Using the identity 



V-E x H* = H**V x E - E V x H" 



the left-hand side of (2.82) can be converted to a surface integral by Gauss’s 
theorem. The result is 



| (ExH*).(-n) dS = j oE-E* dv +jo>j (/iH . H* — eE ■ E*) dv 

E x H* in the left integral is called the complex Poynting vector. The first 
integral on the right-hand side is the average power consumption while the 
second integral is twice the difference between the average magnetic and 
electric energy stored in V. Since the real and imaginary parts can be equated 
separately, we have 



Re 

Im 



J* E x H* • (— n) dS = (average power consumption in V) 

E x H* • (— n) dS — 2a> (average magnetic stored energy 
J s — average electric stored energy) 



(2.83) 

(2.84) 



The power consumed in V may be considered as coming through S, and 
if we assume that the incoming power per unit area is given by Re (E x H*) 
*(-n) at each point on S, the total power coming through S becomes 
exactly equal to the power consumption in V, as shown in (2.83). For this 
reason, let us consider that the real part of the complex Poynting vector 
represents the power flow density. However, this interpretation of Poynting 
vector is rather arbitrary. We could consider that the power flow per unit 
area was given by (Re(E x H*) + V x X} •(- n) where X was an arbitrary 
single valued vector function of position. By Stokes’s theorem, the integra- 
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tion of V x X*( — n) over a closed surface always vanishes, and X does not 
appear in the final result. 



2.3 Plane Waves 

In this section, we shall consider electromagnetic waves which depend 
upon a single straight-space coordinate, i.e., waves whose amplitude and 
phase are constant over each plane perpendicular to the coordinate. Our 
first problem is to determine whether or not such electromagnetic fields can 
exist. To do so, we assume that E and H are independent of * and y in the 
rectangular coordinate system and then seek solutions of Maxwell’s equation. 
Since E and H are independent of x and y, the derivatives djdx and djdy 
must be zero, and when the field vectors are decomposed into their compo- 
nents, Maxwell’s equations (2.79) and (2.80) take the form 



-(dH y ldz) = (a + jwe) E x 


(2.85) 


( dHJdz ) = (a + jae) E y 


(2.86) 


0 = (cr + jcos) E z 


(2.87) 


~ (dEjdz) = — ja>fiH x 


(2.88) 


(dEjdz) = — j(OfiH y 


(2.89) 


0 = - j(OfiH z 


(2.90) 



From (2.87) and (2.90), it is obvious that E z and H z become zero. Next, 
eliminating H y from (2.85) and (2.89), we have 

d 2 E x , 2 

+ (o) eii- jojuo) E x = 0 (2.91) 

where the symbol for ordinary differentiation is used instead of the one for 
partial differentiation since E x is a function of z only. If E x is obtained from 
this equation under appropriate boundary conditions, H y is then determined 
from (2.89), E y and H x however, remain undetermined. If we eliminate H x 
from (2.86) and (2.88), we obtain an equation for E y identical to (2.91). 
This equation and appropriate boundary conditions determine E y and H x 
is then obtained from (2.88). Thus, E x and H y constitute one pair of related 
vectors, and E y and H x another. These pairs are generally independent of 
each other, except for possible interaction at the boundary. Since they are 
similar, let us first concentrate on the first pair E x and H y . Equation (2.91) 
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is an ordinary differential equation with constant coefficients. Therefore, if 
we assume that E x = Ae~ yz , just as we did in (1.7), and substitute it into 

(2.91) , we have 

y 2 = — (co 2 efi — jcoftff) (2.92) 

For the two solutions of (2.92), let us use the expression 

y = ±(oc+jp) (2.93) 

where a and /? are real, and assume that /? is positive. Since y 2 , given by 

(2.92) , is in the second quadrant of the complex plane, the y’s are in the 




Fig. 2.2J. Relation between y 2 and y. 

first and third quadrants, as shown in Fig. 2.21. From this we see that a is 
also positive since is assumed to be positive. From (2.92), we can write 

a + jfi =j(co 2 en — jcona) 112 (2.94) 

where the real part of the square root is taken as positive. To eliminate 
unnecessary confusion in future, let us now agree that the real part of ( ) 1/2 
is always positive. When the other root is required we shall write it as 
— () 1/2 . Note that the sign in (1.8) complies with this rule. 

From the above discussion, the most general expression for E x is given by 

E x = Ae~^ +JP)2 + Be (<t+JIS)z (2.95) 
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Substituting (2.95) 


into (2.89), we obtain 




where 


H y = Z {Ae~ (a+jP)z - Be {a+Jp)z } 


(2.96) 




z o = 0/{« - ■/(*/*>)}] 1/2 


(2.97) 



Substituting (2.95) and (2.96) into (2.85), we see that (2.85) is also satisfied. 
Thus, we conclude that an electromagnetic field does exist which is indepen- 
dent of the x- and ^-coordinates and satisfies Maxwell’s equations. 

Since it is the ratio of electric to magnetic fields Z 0 has the dimension of 
impedance; electric field times distance has the dimension of voltage, and 
magnetic field times distance has that of current. The quantity Z 0 is deter- 
mined by the property of the medium only and is referred to as its characteristic 
impedance with a positive real part in accordance with the agreement on the 
sign of square root. Note that we used Z 0 in Section 1.1 with a different 
meaning. From a figure similar to Fig. 2.21, it can be shown that the imagi- 
nary part of Z 0 is also positive, in other words, Z 0 is inductive. 

The constants A and B in (2.95) and (2.96) are determined by boundary 
conditions. First, however, let us assume B = 0 and concentrate on the terms 
with A only. The term e~ iPz represents a wave traveling in the positive z 
direction, and e~ zz represents the exponential decay with distance. Since 
Z 0 ” 1 has a negative imaginary part, let us write it in the form J Z 0 1 “ 1 e~ Jv . 
To interpret the magnetic field expression, we multiply it by yj 2 e jv>t and 
take the real part of the result: 

ReySiZof 1 e-» Ae-i m+jn * e iat } = y/2\Z 0 \~ l e~ az A cos (cot - pz - q>) 

Thus, the phase of magnetic field lags by <p behind that of electric field as 
depicted in Fig. 2.22. Similarly, the terms with B are found to represent an 




Fig. 2.22. Electromagnetic field of a plane wave. 
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electromagnetic wave traveling in the negative z direction with decreasing 
amplitude as it propagates. Since at each instant, the electric and magnetic 
fields are both constant over each plane perpendicular to the z axis, the 
above electromagnetic waves are called plane waves. 

When a is equal to zero, no attenuation takes place. Furthermore, the 
phase lag of the magnetic field disappears. In this case, the propagation 
constant and the characteristic impedance are given by 

a +jft = 0 +jco(Efi) 112 (2.98) 

Zo = W<0' /2 (2.99) 

respectively. The corresponding wavelength is 

X = 27 ijfi = 2 tc/co (e/j)' 2 

On the other hand, when a is large, e can be neglected compared to <x/co. 
Thus, we have 

a +jP ~ (jtofur) 112 = (1 + j) (a>/nr/2) 1/2 (2.100) 

Zo * «( -j<rl<o)}' n = (1 + j) (o>/i/2 a) 1 ' 1 (2.101) 

This means that the phase of magnetic field lags 45° behind that of electric 
field and since the magnitude of Z 0 is small, the magnetic field predominates 
over the electric field, compared to the free space case. The amplitudes of 
both electric and magnetic fields decrease by a factor of 1/e when they travel 
a distance 1/a = (2/a>/w) 1/2 called the skin depth of the conductor. 

Let us consider the transmission power per unit area. Since the electric and 
magnetic fields are orthogonal to each other and both are perpendicular to 
the direction of propagation, the Poynting vector is in the z direction, and 
its value is given by E X H*. Therefore, the transmission power is Ke{E x H*} 
per unit area. 

Let a and b be defined by 

a =i|ReZ 0 r 1,2 (E* + Z 0 H„), i. = i|ReZ 0 r 1/2 (£,-Z 0 *H,) (2.102) 

Then, a calculation similar to the one leading to (1.53) gives 

|a| 2 -|6| 2 = Re {£,?/,*} (2.103) 

This means that \a\ 2 — \b \ 2 is equal to the transmission power per unit area. 
Substituting (2.95) and (2.96) into (2.102), we have 




a = |ReZ 0 | -1/2 Ae~ ia+mz 



(2.104) 
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b = IReZol' 1 ' 2 (2.105) 

(. Z 0 Z 0 J 

When a = 0, Z 0 becomes real, and the first term in the parentheses on the 
right-hand side of (2.105) disappears. In this case, a and b can be interpreted 
as the waves traveling in the positive z and negative z directions, respectively, 
and the square of the magnitude of each wave gives the power transmitted 
in the direction of propagation. On the other hand, when a ^ 0, b can no 
longer be interpreted as a wave traveling in one direction since it consists 
of two terms, one traveling in one direction, and the other in the opposite 
direction. Nevertheless, a and b can be considered as waves associated with 
the incident and reflected powers, respectively. Even when the fields travel in 
one direction, e.g., when 5 — 0, there is, in general, a power reflection |6| 2 . 
Therefore, when Z 0 is complex and A is fixed, it is necessary to introduce some 
field reflection, i.e., a finite 5, if the maximum power is to be transmitted. 

On the other hand, in contrast to the definition in (2.102) where a and b 
represented power waves, another pair of waves a{z) and b(z) can be defined 
by 

a(z) = 1,2 (E x + Z 0 H y ), b(z ) = \Z~ Q l/1 (E x - Z 0 H y ) (2.106) 

corresponding to (1.16). Then, we have 

a{z) = Zo 1/2 Ae~ (a+Jl,)z , b(z ) = Z 0 " 1/2 Be (a+jfi)z (2.107) 

which indicate that a{z ) and b(z) are the traveling waves in the positive and 
negative z directions, respectively. Now however, the power expression 
corresponding to (2.103) can no longer be obtained (see Problem 2.8). In 
other words, the traveling waves a(z) and b(z) are not associated directly 
with the transmission power when Z 0 is complex. 

As we mentioned before, there is another pair of vectors E y and H x in 
addition to the one we have discussed so far. For this pair, repeating the 
same procedure, we obtain 

E y = Ce~ (a+mz + De (a+mz (2.108) 

H x = - Zo 1 {Ce~ (a+jP)z - De { * +mz ] (2.109) 

corresponding to (2.95) and (2.96), respectively. Because of the negative sign 
in front of the right-hand side of (2.109), when D = 0, the Poynting vector 
E x H* points in the positive z-direction indicating that power is being 
transmitted in the direction of propagation. The relation between E y and 
— H x is exactly the same as the relation between E x and H r hence these two 
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electromagnetic waves have identical properties except for their field 
directions. 

A more general expression for a plane wave is obtained by the super- 
position of i X E X and i y E y for the electric field and that of i X H X and i y H y for 
the magnetic field. The transmission power due to this superposition is 
given by 

Re(E x H*) = Re {(i x E x + i y E y ) x (i X H X * + i y H*)} 

= Re (i z E x H y *) + Re (i z E y • - H*) (2. 1 10) 

The first and second terms on the right-hand side of (2.110) express the 
power transmitted by the pairs E xy H y , and E y , H x , respectively. Thus the total 
power is the sum of the powers of individual waves. In other words, each 
wave can be considered as carrying its own power independent of the other. 
One might think this is obvious, however, if E x = A Y e~ jpz , H y — Zq 1 A 1 e~ jpz 
represent one electromagnetic wave and E x = A 2 e~ jPz , H y — Zo 1 A 2 e~ jpz 
another, then the superposition of these two waves gives E X = (A Y +A 2 ) 
e~ jPz , H y = Z 0 1 {A v + A 2 ) e~ Jpz , and the transmission power becomes 

Re(E x H*) = Re [i- (A, + A 2 ) {Z^ 1 (A, + A 2 )}*~\ 

= Re{i 2 (ZoT Mrf 2 } + ReftfO* \A 2 \ 2 } 

+ Re{i 2 (Z 0 , )*(^iA 2 * + '4 2 A i *)} 

The first term on the right-hand side is the power transmitted by the first 
wave alone and the second term that due to the second wave alone. The 
existence of the third term shows that the transmission power is not given 
by the superposition of the individual powers. Thus, the principle of super- 
position does not generally hold for power. The reason why the total power 
is given by the direct sum of the powers of two waves in (2.110) is that the 
electric fields and hence the magnetic fields of these two waves are orthogonal 
in space. Generalizing this concept of orthogonality, whenever there are 
two waves whose powers add when their fields are superposed, they are said 
to be orthogonal to each other. Examples will be given in Chapter 3. 

Let us now turn our attention to the superposed electric field itself. From 
(2.95) and (2.108), we have 

E - ( i x A + i y C) e ~ {a+jp)z + (i X B 4- i y D ) e (a+jP)z (2.111) 

Since the two terms on the right-hand side represent waves propagating in 
the positive and negative z-directions, respectively, let us concentrate on the 
first term and investigate how it changes with time. To do so, we take the real 
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part of yj2 e ji0t 



times the first term, i.e., 



Re {y/2(\ x A + i,C) e j(0! ~ (a+jtl}z 

= ^2 {cos {(at - fiz) (i. Re A + \ y Re C) 

- sin ( (at - pz) (i x \mA + i y Im C)} 

= yj2 e~ az {i x \A\ cos (cot - pz + (p A ) + i y \C\ cos (cot - pz + <p c )} 

(2.112) 



where 



lan (p A = (im A /Re A), tan^ c = (Im C/ReC) 



The factor e shows that the amplitude decreases exponentially with dis- 
tance. The expression inside of the parentheses on the right-hand side is the 
sum of two simple harmonic motions, one oscillating in the x-direction with 
an amplitude \A\ and the other in the ^-direction with an amplitude |C|. 
For a moment, let us fix z and consider the locus of the tip of vector E as 
shown in Fig. 2.23 where it is clear that the locus generally becomes an 
ellipse. Since the locus of the tip of the corresponding magnetic vector H 
also becomes an ellipse, the wave is said to be elliptically polarized. When 
<Pa = <P c or M| =0 or |C| = 0, the ellipse degenerates to a straight line and 
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the corresponding wave is said to be linearly polarized. When j A\ = |C| 
and (p A = (p c + 90°, the ellipse becomes a circle and we have a circularly 
polarized wave. It may also be clear from Fig. 2.23 that the rotational 
direction becomes either clockwise or counterclockwise depending on 
whether <p A < < p c or <p c < <p A . If we move the observation point z in the 
direction of propagation, the size of the ellipse becomes smaller and the 
direction of the field vector at the same instant changes. 

The wave propagating in the negative z-direction has identical properties, 
e.g., depending on the amplitude and phase relations between B and D, the 
locus of the tip of vector E becomes an ellipse, a straight line, or a circle. 
The superposition of these two waves propagating in the positive and 
negative z-di recti ons produces a locus which can be an ellipse, a straight 
line, or a circle at a fixed distance z. However, if the observation point is 
moved, the locus changes not only its size but also its shape resulting in a 
rather complicated polarization which depends on z. 

So far in this section, we have assumed E and H to be independent of x and 
y, and we assumed terms of the type d/dx and d/dy to be zero. Let us next 
study the same waves from a slightly different viewpoint using vector expres- 
sion with a, e, and n assumed constant and independent of the position in 
space, as before. 

Taking V x (2.79) and substituting (2.B0) into this result, we eliminate H: 

V x V x E = — jojpi V x H = — j(on(a + joe) E 

The left-hand side is equal to V(V-E) — V 2 E and since V-E is equal to 
zero, as can be shown by taking the divergence of (2.80), the above equation 
can be rewritten in the form 

V 2 E + (o 2 en - jofia) E = 0 (2. 1 13) 

Comparing (2.113) with (2.91), we see that d 2 jdz 2 and E x in (2.91) have been 
replaced by V 2 and E, respectively, however, they both belong to the same 
type of differential equation. Since the solution of (2.91) was obtained by 
assuming the functional form of Ae~ yz , in the present case 
E== Aexp(-h-r) = Ae (- ' , * Jt ' t ' h>,) ’ + ' ,l2) should give a solution where h is a 
constant vector to be determined. Using the relation 

V 2 E = A {(d 2 /dx 2 ) + (d 2 ldy 2 ) + (d 2 /dz 2 )} 

= A (h 2 + h 2 + h 2 ) e -V'** +h yy +h **-> 

= A(h*h)exp(- h-r) 
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(2.1 13) becomes 

A (h • h + ( or eft — jcofuj)} exp ( — h • r) = 0 

Therefore, 

h ♦ h = — (o 2 en — jcafiff) (2.114) 

is a necessary and sufficient conditon for E = A exp(- h-r) to be a solution 
of (2.1 13). It is also necessary to check whether or not this satisfies Maxwell’s 
equation which is done by substituting this expression into (2.79) and (2.80). 
From (2.79), we have 

V x E = V x A exp(- h-r) = V exp(- h-r) x A = - jwfiH 

Since 

V exp ( — h • r) = {i x (dldx) + i ,(dldy) + i 2 (d/dz)} e ~ ihxX+h >' y+h ‘ z) 

= - 0A + »A + *A) exp ( - h • r) 

= — h exp ( - h-r) 

H is given by 

H = (jony 1 h x A exp(- h-r) = ( j(ofx)~ 1 h x E (2.115) 
Substituting this into (2.80), we obtain 

V x H = V x (jofiy 1 h x Aexp(- h-r) = Vexp(-h-r) x (jco^y 1 ^ x A) 
= - (jcofiy 1 h x h x A exp(- h-r) 

= (<? + jcoe) E = (<r + jcoe) A exp ( - h • r) 

This requires A to satisfy 

h x h x A = ( (o 2 eft — jofic r) A 

The left-hand side is equal to h(h-A) -(h-h) A, and since h-h is given by 

(2.114) , the above requirement reduces to 



h • A = 0 



(2.116) 



It follows from the above discussion that E = Aexp(-h-r) satisfies Max- 
well’s equations provided that (2.114) and (2.116) are satisfied. 

Both A and h can be complex vectors. For example, the directions of the 
real and imaginary parts of the vector h may not be the same, and, therefore, 
the direction of maximum attenuation may be different from the direction of 
maximum phase variation. For simplicity, however, let us confine ourselves 
to the case in which these two directions coincide and let n be a unit vector 




h = n(a +jP) 



(2.117) 
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where a and fi are real. Since exp( — h-r) = exp{ — (a +jfi) n»r}, both E 
and H become constant over the surface where n*r is constant. As is illus- 
trated in Fig. 2.24, a constant n*r surface is a plane and, hence, the electro- 
magnetic wave under consideration is a plane wave. Substituting (2.117) 
into (2.1 14), we have 

(a +jfi) 2 = - (to 2 s/i -joiiia) 



= CONSTANT 

Fig. 2.24. Plane satisfying n*r = constant. 




This indicates that a and /? are exactly the same as those obtained from 
(2.94). In addition, by comparing (2.116) and (2.117), it can be seen that 

n • A = 0 (2.118) 

since (a + jfi) is constant and not equal to zero, and consequently, A is 
normal to the direction of propagation. 

From (2.1 15), H is given by 

H = Zo‘n x E (2.119) 

where Z 0 is the characteristic impedance defined by (2.97); and H is normal 
to both n and E. We, therefore, see that the electric and magnetic fields are 
perpendicular to each other and at the same time both are normal to the 
direction of propagation. This corresponds to the result that E x and H y i 

made one pair of vectors, and E y and H x another, while E z and H z were zero 
in the previous discussion. I 

As long as A and h satisfy (2.114) and (2.116), E = A exp(-h-r) gives a 
solution of Maxwell’s equations. A superposition of many solutions of this 
type also contitutes a solution which will not be discussed here because of 
its complexity. | 



t 






Problems 
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PROBLEMS 



2.1 By decomposing vectors into their rectangular components, prove the following 
formulas: 

A X (B X C) — ( A • C) B — (A-B)C 
V-V X A-0 
V X = o 

2.2 Derive the expressions for V • A, V<p, V - Vip in the spherical coordinate system. 

2.3 Prove in the cylindrical coordinate system, that V X A is given by 

V X A = i r + u(*±-'-±) + u ji 0 (rAi) -I*±! 

\r 36 dz J ^02 cr J [r dr r 36 ^ 

2.4 Prove that the integer «, satisfying V 2 r n =• 0, is either 0 or — 1. 

2.5 Prove that 

J ^ (p dv = — V dv + J (pVy/’ndS 

J s<pV 2 >J' dS = - J v-Vy dS + j" <pVyndl 

2.6 Calculate the skin depth of copper at 4 GHz. The conductivity of copper is approx- 
imately 5.8 x 10 7 U/m and the permeability is 1. Also, calculate the characteristic 
impedance of copper at 4 GHz. 

2.7 The tangential components of electric and magnetic fields must be continuous at an 
interface of two media. Assume that a plane wave is incident on an interface from 
the normal direction and calculate the power reflected relative to the incident power 
in terms of the dielectric constants and permeabilities of the media on both sides of 
the interface. 



2.8 Prove that 



(fl(z)| 2 - |b(z)| 2 = |Zo| _1 Re {ZaE x *Hy} 



where a(z) and b(z) are defined by (2.106) and that the right-hand side is not equal 
to Re {E x Hy*} when Zo is complex. 



2.9 Show that Maxwell’s equations (2.79) and (2.80) have no solution other than a 
trivial one when o =£ 0 and the electric and magnetic fields are functions of r only, 
i.e., distance from the origin. In other words, show that an electromagnetic wave 
with spherical symmetry cannot exist. {Hint: Use Gauss’s and Stoke’s theorems.) 
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CHAPTER 3 

WAVEGUIDES 



A cylindrical pipe designed to contain one or more propagating electro- 
magnetic waves is called a waveguide. We shall discuss the theory of wave- 
guides in detail in this chapter. Since waveguides require a treatment quite 
different from conventional circuit theory and radically new to some of us, 
we shall first introduce the particular case of rectangular waveguides. This 
will enable us to become acquainted with new terminology and some 
methods we shall use later. Following this will be an introduction to eigen- 
value problems using an equation derived in the above discussion. Solutions 
of an eigenvalue problem are called the eigenfunctions, and these have wide 
applications in many branches of physics, particularly in acoustics and 
quantum mechanics. Suppose a function representing a linear phenomenon 
is to be determined, we can express it as a linear combination of eigen- 
functions and determine the coefficients using appropriate equations 
governing the process. Once this expression is obtained, we interpret the 
phenomenon as the superposition of simple phenomena, each corresponding 
to an eigenfunction; the method is called the eigenfunction approach. For 
this approach to be useful, it is important to select eigenfunctions whose 
individual behavior is simple and well understood. Therefore, depending 
on the problem, a set of eigenfunctions is chosen to satisfy a certain eigen- 
value problem closely related to the phenomenon under study. Whether or 
not the function to be determined can be expressed as a linear combination 
of eigenfunctions thus selected remains to be investigated. This, of course, 
depends on the eigenvalue problem used for their derivation, but when the 
linear combination is possible for any function of interest, the set of eigen- 
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functions is said to be complete. If the set does not have this property, the 
eigenfunction approach breaks down, even though all the coefficients could 
be determined, since the functions itself cannot be expressed as the linear 
combination in the first place. The completeness of eigenfunctions is dis- 
cussed, therefore, as thoroughly as possible within the reach of elementary 
mathematics. This is followed by discussions on eigenfunctions for wave- 
guides, their general theory, waveguide discontinuities, the effect of lossy 
walls, and waveguides with inhomogeneous media. 



3.1 Perfect Conductors 

Although actual conductors are not perfect, we shall, nevertheless, first 
study a straight waveguide bounded by a perfect conductor. To prepare for 
this discussion, let us consider what a perfect conductor is. A perfect conduc- 
tor is an idealized medium within which the electric field is always equal to 
zero. A good conductor is generally considered as a medium whose conduc- 
tivity <j is large. Therefore, it seems appropriate to define a perfect conductor 
as a medium whose conductivity is infinite. The conditions a->oo and 
E = 0 can be considered equivalent to each other, using the following 
reasoning. The power consumed in a volume V inside a conductor is given 
by jV oE 2 dv. If a increases indefinitely while E remains finite, the power 
consumption must approach infinity. However, since any realistic power 
source can deliver only a finite amount of power, it seems appropriate to 
assume that the condition a -> co corresponds to E-»0 in ordinary cases. 
Furthermore, as we discussed in Section 2.3, the amplitude of a plane wave 
diminishes quickly as it propagates in a conductor, and as a increases, the 
attenuation becomes so large that an electromagnetic field inside a good 
conductor must become negligibly small. In the limit as <r->oo, the field 
must be zero. 

Let us consider an interface between a perfect conductor and a medium 
with finite values of conductivity, dielectric constant and permeability. 
Taking a rectangular path as shown in Fig. 3.1, we integrate both sides of 
(2.79) over the area S of the rectangle. By Stokes's theorem, the left-hand 
side can be converted to a line integral around the closed path a-b-c-d : 

I E • dl = - jeon f H • n s dS 

J abed J 

where n s is the unit vector normal to AS. Letting the width <5 of the rectangle 
diminish, only those contributions from ab and cd remain on the left-hand 
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Fig. 3.1. Integral contour abed. 



side, while the right-hand side becomes negligibly small since H*n s stays 
finite and S' approaches zero. Thus, we have 

f_E -dl+ f_E-d/ = 0 

J ab J cd 

Assuming the length Al of the rectangle is small, this equation can be 
rewritten in the form 

(Ej - E 2 )«n x n 5 Al = - n x (E, - E 2 )»n s Al = 0 

where n is the unit vector normal to the interface, as shown in Fig. 3.1, 
and Ei is the electric field just outside the conductor and E 2 just inside the 
conductor. The orientation of the rectangle, and hence that of n 5 , is arbitrary 
as long as n s lies in the interface. Therefore, from the above equation we 
obtain 

n x (Ej — E 2 ) = 0 

However, since E 2 = 0 by the definition of a perfect conductor, the electric 
field E t just outside the conductor must satisfy 

nxE, =0 (3.1) 

This indicates that the tangential component of an electric field at the 
surface of a perfect conductor is zero. Since n x E = 0 at the surface, the 
the Poynting vector E x H* has no component directed toward the inside 
of a perfect conductor, i.e., n-ExH* = 0. Therefore, all of the incident 
power is reflected back from the surface and none penetrates the perfect 
conductor. 
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Let us consider the magnetic field at the interface. It follows from (2.79) 
that H is zero inside a perfect conductor since E is equal to zero. If H is 
zero, (2.80) shows that not only E but also a E is zero inside a perfect 
conductor. That is, no conduction current flows inside it even though a is 
infinite. However, conduction current can flow at the interface. Let us inte- 
grate (2.80) over the rectangular area shown in Fig. 3.1. By Stokes’s theorem 
we obtain 

- n x (Hj — H 2 )*n s Al = (tE*d s z1S = i*n s AS 

where H t is the magnetic field just outside the conductor and H 2 just inside 
it. Since o is infinite, i AS = oE-AS is assumed to be finite in the limit of 
S -> 0. Let us define the surface current density K through 

K Al = Iimi AS 

< 5->0 

Since H 2 is equal to zero, the above equations show that 

- n x H, = K (3.2) 

The tangential component of E vanishes at the interface ; however, that of H 
usually does not, and the corresponding surface current flows perpendicular 
to it. 

3.2 TE 10 Mode in Rectangular Waveguides 

Let us first consider a straight waveguide with a rectangular cross section 
as shown in Fig. 3.2. We choose this particular model because of its practical 
value and its simplicity in mathematical treatment. The inside medium is 
assumed to be homogeneous and its conductivity equal to zero. Furthermore, 
it is assumed that the electric field has ^-component (E y ) only, i.e., 
E x = E z = 0. Under these conditions, Maxwell’s equations become 



(dHJdy) - (dHJdz) = 0 


(3.3) 


(dHJdz) — ( dHJdx ) = jcoeE y 


(3.4) 


(dHJdx) - (dHJdy) = 0 


(3.5) 


- ( dEJdz ) = — j(QfiH x 


(3.6) 


0 = - jcofiHy 


(3.7) 


(dEJdx) = - j(t)[iH z 


(3.8) 
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From (3.7), we obtain H y = 0. Substitution of this result into (3.3) and (3.5) 
gives dHJdy = dHjdy = 0. Since dHJdy = 0, the magnetic field H does 
not change in the ^-direction. Next, differentiating (3.6) and (3.8) with 
respect to z and x, respectively, and substituting the results into (3.4), we 
obtain the wave equation for £ y , 



d 2 E y d 2 E y 

+ a?" 



+ co 2 EfiE y = 0 



(3.9) 



Thus, the problem is reduced to the solution of this equation under appro- 
priate boundary conditions given by E y = 0 at x = 0 and x = a which 
ensure that the tangential field component at the surface of the perfect 
conductor is equal to zero. Since has no tangential components at 

y = 0 and y = b, the boundary condition there is automatically satisfied. 
Once E y is obtained, H x and H z can be calculated from (3.6) and (3.8), 
respectively. Therefore, all the nonzero components of the electromagnetic 
field can be determined. 

Let us consider particular solutions of (3.9) which vary with z as e~ yz . 
Differentiation with respect to z therefore becomes equivalent to the multi- 
plication by — y. Thus, the wave equation and the boundary conditions 
become 



+ (y 2 + co 2 en ) E y - 0 





(3.10) 

(3.11) 



3.2. TE 10 Mode in Rectangular Waveguides 



89 



Since (3.10) can be considered as an ordinary linear differential equation, 
the solutions must be exponential, sine or cosine functions of x. In order to 
satisfy the boundary conditions at x = 0 and x = a, sin(/77rx/a) must be 
used where n is an integer. Since (3.10) does not specify variation in the 
^-direction, we must take 

E y = A„ sin (nnxfa) f (y) e~ yz 

as a trial function for the solution where f(y) is an arbitrary function of y 
only. Substituting this expression into (3.6) and (3.8) and remembering that 
H has no variation in the y-direction,/(y) is found to be a constant. There- 
fore, we have 

E y = A„sin(nnx/a) e yz (3.12) 

Substituting this into (3.10), we obtain the necessary and sufficient condition 
for (3.12) to be a solution of (3.10), i.e., 

- (nn/a) 2 + y 2 + o) 2 efx = 0 

or, equivalently, 

y = ± {(nnja) 2 - (o 2 efi} il2 (3.13) 

Substituting (3.12) into the original simultaneous equations (3.3) to (3.8), 
we find no contradiction regardless of the value of integer n in (3.13). This 
shows that E y in (3.12), indeed, gives a solution for Maxwell’s equations 
provided that y satisfies (3.13). 

Since the simultaneous equations (3.3) to (3.8) together with the boundary 
conditions (3.11) are linear, any linear combination of the functions on the 
right-hand side of (3.12) with various values of n should also represent a 
possible E y . However, let us concentrate on individual terms. For each n, 

(3.12) gives a possible configuration of the electric field and hence that of 
the electromagnetic field in the waveguide. Such a configuration of the 
electromagnetic field is called a mode. For each mode, as co is increased, y 
always becomes purely imaginary beyond a particular value co c = (spi)~ V2 
(nnja). This indicates that the field as a whole begins to propagate through 
the waveguide above the frequency f c = (oJ2n, where f c is called the cutoff 
frequency of the mode. The free space cutoff wavelength, A c , corresponding 
to f c is given by 2 ajn. Below f c , y is real, showing that the amplitude of the 
electromagnetic field changes exponentially with z. If a mode is excited below 
the cutoff frequency in a certain section of the waveguide, then the amplitude 
cannot increase with distance away from that section. Therefore, the sign of 
y must be selected so that the wave amplitude decreases with distance in 
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both directions. It becomes positive on the positive side and negative on 
the negative side of the source of excitation. 

As the frequency is raised, the first mode to start propagating corresponds 
to n = 1. In practice, if several modes propagate simultaneously, the behavior 
of the waveguide becomes complicated and hard to control. The cross- 
sectional dimensions are generally chosen so that the mode corresponding 
to n = 1 becomes the only propagating mode with the remaining modes in 
the cutoff region. Since no z-component (i.e., longitudinal component) 
exists in the electric field of the basic mode, it is called a transverse electric 
mode or, in short, a TE mode. For n = 1, since the ^-directed field intensity 
has one maximum in the x-direction and none in y, the subscript 1,0 is 
attached, and the mode is identified as the TE 10 mode in the rectangular 
waveguide. This particular mode is sometimes called a dominant mode since 
it can exist when all other modes are in the cutoff condition, and they 
become negligibly small beyond a certain distance from their source of 
excitation. 

The phase constant of the TE 10 mode is given by 

p = {a>hp - (nlafY 11 = Inr 1 {1 - (A/4) 2 } 1 ' 2 (3.14) 

where y = ±jfi and (3.13) are used. The corresponding wavelength in the 
waveguide becomes 

X s = 2xlP = X{l-(Xll c ) 2 }-' 12 (3.15) 

When the free space wavelength approaches X c , X g becomes longer and at 
X = X c , it becomes infinite. Beyond X c , the field ceases to propagate. 

All the nonzero components of the electromagnetic field are obtained 
from (3.12), (3.6) and (3.8). Thus, for the TE 10 mode, we have 

E x = 0 

E y = (Ae~ ipz + Be Jpz ) sm(nxja) 

£ z =0 

H x = - {Zo 1 ) {Ae~ m - Be jpz ) sin(7tx/a) 

H y = 0 

H z = j{nla) (o)n)~ 1 (Ae~ JPz + Be jPz ) cos (nxf a) 
where A and B are constants and 

Z 0 = (hIb) 112 {1 - (AM*) 2 } " 1/2 

Let us investigate the field configuration concentrating on the terms with 
A only, since the terms with B give identical field configurations except that 



(3.16) 



(3.17) 




the direction of the propagation is opposite. The field repeats itself with a 
period of X g in the z-direction. In the x-direction, E y shows a sinusoidal 
variation with its maximum at the center (x = a/2) of the waveguide while 
H x has the same variation except for the opposite sign. On the other hand, 
H z is a cosine function of x with its zero at the center and with a phase 
differing 90° from E y or H x . Therefore, the electric and magnetic field 
configurations must look like those in Figs. 3.3(a) and (b), respectively. 




(a) E (b) H 

Fig. 3.3. Electric and magnetic fields in waveguide. 



The vector H encircles the point where E = 0 or, equivalently, where the 
displacement current e 8E/8t becomes maximum. Figure 3.3 shows the 
patterns over a half wavelength in the z-direction at a fixed instant of time. 
The other half wavelength is identical except for the direction of the arrow- 
heads. Those patterns move in the z-direction with time travelling with a 
phase velocity v p = <«//?. 

Let us next consider the surface current on the waveguide wall, from (3.2). 
This is perpendicular to the magnetic field and hence should look like Fig. 
3.4. If we remember that this pattern moves in the z-direction with time, at a 
fixed point on the wall, the surface current changes its magnitude and direc- 
tion with time. At the center of the broad surface where x = a/2, the trans- 
verse current always remains zero. Therefore, the waveguide can either be 
split in two or a longitudinal slot can be cut along the center line without 
disturbing the inside fields appreciably. 

To excite the TE 10 mode, the center conductor of a coaxial line can be 
extended into the waveguide in the form of an antenna from the broad 
surface while its outer conductor is short-circuited to the wall. A short- 
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Fig. 3.4. Surface current on wall. 



circuiting plunger is usually inserted in the waveguide behind the antenna 
to prevent power going in the other direction. This type of antenna excites a 
number of other modes, but if the TE 10 mode is the only propagating mode, 
this can be launched efficiently by adjusting the size of the antenna as well 
as the position of the plunger. 

A comparison of E y , — H x in (3.16) with the voltage (1.9) and the current 
(1.14) on a transmission line shows that they are identical except for the 
factor sin(nxja). As far as longitudinal variations are concerned, the 
electric and magnetic fields vary exactly in the same way as V and / on a 
transmission line with characteristic impedance Z 0 and phase constant /?. 
Thus, introducing V and /, we can write the fields as follows: 

E y = KV sm(nxja ) (3.18) 

— H x = Kl sin(7rx/a) (3.19) 

where A! is a constant. Since the behavior of V and / can be studied on the 
Smith chart, the variation of E y and - H x in the z-direction can also be inves- 
tigated on the same chart. In the theory of transmission lines, the variation 
of the impedance Z defined by the ratio of V to I is obtaind from the Smith 
chart. The corresponding quantity in the waveguide is the impedance 
defined by 

Z = - (E y IH x ) (3.20) 

The power transmitted in the positive z-direction is given by the real part 
of the integration of Poynting vector over the waveguide cross section, 



|Ex 



H*k dS 



= jE,(-H*)dS 

= | (sin (nx/a)} 2 KK'VI* dS = i abKK'Vl * 
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where k is a unit vector in the z-direction. If K is defined as 

K = y /2(ab)' 112 (3.21) 

then the power is given simply by Re{T/*} which is equal to the trans- 
mission power on the transmission line. It is worth mentioning that (3.21) 
is not the only condition for the power to be expressed as Re {VI*}. Since 
the power relation specifies the magnitude of K only, the phase of K is still 
arbitrary. This phase is set equal to zero to avoid unnecessary complications. 
If V' =nV and /' = //«, where n is a positive constant, are substituted into 
(3.18) and (3.19), a comparison with (3.16) shows that W and /' represent 
the voltage and current on a transmission line with the characteristic 
impedance n 2 Z 0 and the phase constant ft. Furthermore, the transmission 
power in the waveguide is given by Re {VT*} while the ratio of E y to — H x 
is given by Z’/n 2 , where Z' is the impedance defined by the ratio of W 
and /'. Therefore, V' and /' can be adopted equally well to represent the 
variation of the electric and magnetic fields inside the waveguide. It follows 
from this argument that the value of the characteristic impedance of the 
transmission line is not uniquely determined for representing the waveguide. 
In particular, the characteristic impedance n 2 Z 0 can be chosen to be unity 
in which case the difference between the normalized impedance and actual 
impedance disappears. 

Although there is a one-to-one correspondence between the longitudinal 
variations of the transverse electric and magnetic fields of one mode in the 
waveguide and the variations of the voltage and current on the transmission 
line, the phase constant and the characteristic impedance considered above 
change with w quite differently from those of the transmission line discussed 
in Section 1.1. There, Z 0 was independent of co while ft was proportional to 
co. Since p given by (3.14) is not proportional to co, the phase velocity 
Vp = wj p changes with co. Two waves, one at frequency co and the other at 
co + Aco, have different velocities. When they are superposed, the velocity 
of the envelope is, therefore, different from either of their individual phase 
velocities. For simplicity, let us consider a superposition of two waves with 
the same amplitude. It is given by 

A cos (/?z - cot ) + A cos {(/l + Ap) z — (w + Aw) *} 

= 2yl cos {j(Apz — Awt)} cos {(/? + \ AP) z — (w + \ Aw) t } (3.22) 

The first cosine term cos {\(A(1 z — Aw f)} represents the envelope. Repeating 
a discussion similar to the one used for the phase velocity in Section 1.1, 
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the velocity of the envelope is found to be dco/d/1. The last cosine term in 
(3.22) shows that the wave inside the envelope travels with the mean phase 
velocity of the two individual waves. If we illustrate the superposed wave as 
shown in Fig. 3.5, the dotted line representing the envelope moves with a 
velocity different from that of the solid line representing the detail. If we 
watch the space between two adjacent nodes of the envelope, the detailed 
wave may appear to be generated at the left-hand side and to propagate 
toward the right. At the nodes indicated by A t and A 2 in Fig. 3.5, 




Fig. 3.5. Superposition of two waves with frequencies to and co + Jco. 



V and /, and, hence, the electric and magnetic fields are zero. It is, therefore, 
unlikely that electromagnetic energy is transferred in either direction across 
these points. The energy stored between A : and A 2 must travel with the 
same velocity as the envelope and is that carried by the superposition of the 
two waves. In the limit when Aco~* 0, the two frequencies coincide with 
each other, and the velocity of the envelope becomes 



A to dco 
v » = lim 7. = -,-R 

Ao->0 A p up 



(3.23) 



This is called the group velocity. Since the argument about the velocity of 
energy holds regardless of the value of Ato, it is considered to be valid in the 
limit when Ato-*0. Hence, the group velocity is considered to be the velocity 
of energy carried by the electromagnetic wave at frequency to. 

Note that the above argument holds only when the system is lossless; if 
it is lossy, nodes like the ones shown in Fig. 3.5 may not exist. Even if they 
do exist, as Ato approaches zero, the distance between nodes becomes long 
and the wave amplitude decreases with distance. In the limiting case in 
which Ato-> 0, it disappears before reaching the next node. Therefore, the 
above conclusion of energy velocity being equal to the group velocity can no 
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longer be reached. In other words, the group velocity defined by (3.23) 
generally does not represent the velocity of energy in lossy cases. 

For the present waveguide, we have from (3.14) 

= (®/« = n a - (w 2 r 1/2 = a - (w 2 } - 1/2 (3.24) 

V9 ^% = (<E) = 1 fa 2 ®** “ W fl ) 2 } 1/2 

= --=~ = ® 0 {l ~(W 2 } 1/2 (3.25) 

v p ep v p 

where 

»o =M " l/2 

As the free space wavelength X approaches X c , the phase velocity v p increases 
indefinitely. On the other hand, the group velocity v g always stays below v 0 
confirming a result of the theory of relativity: The velocity of substance or 
energy never exceeds the light velocity. From (3.24) and (3.25), we obtain an 
interesting relation between v p and v g , 

= V 0 2 

This relation holds for lossless waveguides filled with homogeneous media; 
however, one should not jump to a quick conclusion that the product of 
phase and group velocities is always a constant. Let us assume the product 
is constant, then we obtain the relation 

v p v g = (to IP) (dcoldfi) = const 

which gives the solution fi= ±(c,w 2 + c 2 ) 1/2 where c x and c 2 are arbitrary 
constants. The ft given by (3.14) corresponds to this expression with c y =ep 
and c 2 — —(n/a) 2 . This result is generally not true; for example, when the 
waveguide is lossy, and hence the product of v p and v g is not necessarily a 
constant. 

Finally, let us consider briefly the effect of discontinuities in the waveguide. 
If there is a window in the waveguide as shown in Fig. 3.6, the tangential 
electric field must vanish on the conducting walls indicated by the shaded 
area. This boundary condition is not generally satisfied by the electro- 
magnetic field given in (3.16). Therefore, a number of other modes are 
generated to satisfy it when all are added together. Since these additional 
modes are usually in the cutoff region, their amplitudes decrease with 
distance away from the window, and beyond a certain distance d, all the 
modes except the TE, 0 mode practically disappear. Let A and B in Fig. 3.7 
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(a) 



A B 




Fig. 3.7. Explanation of equivalent circuit of window (a) Waveguide top view; 
(b) Equivalent circuit. 





be reference planes located as shown. Then as far as the region outside of 
A and B is concerned, the waveguide on each side can be represented by a 
transmission line corresponding to the TE 10 mode. Since both Maxwell’s 
equations and the boundary conditions are linear, there should be a linear 
relation between the E ' s and J7’s at A and B and, hence, between the V’s 
and /’ s at the corresponding points on the transmission lines. This is express- 
ed by 

Vi = Zn^ + Z 12 I 2 , V 2 = Z 12 I l +Z 22 I 2 
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where V t and V 2 are the voltages at A and B, respectively, and / x and I 2 
are the corresponding currents. Thus, each discontinuity in the waveguide 
can be treated as a two-port network inserted in the corresponding trans- 
mission line. The coefficient of I 2 in the first equation and that of / x in 
the second equation are the same if the circuit is reciprocal. This point will 
be explained in Section 5.2 in detail. 

When a thin window is placed perpendicular to the waveguide axis as 
shown in Fig. 3.6, it is known that the two-port network is equivalent to a 
length of transmission line with the same electrical length as the waveguide 
between A and B, shunted by a reactance at the position of the window, as 
shown in Fig. 3.8. Furthermore, if the window has the shape shown in the 




i i 



Fig. 3.8. Equivalent circuit of inductive window (a) Waveguide top view; (b) Corre- 
sponding equivalent circuit. 

figure, the reactance is inductive. This may become plausible if we consider 
that the window introduces excess conduction current on the wall and, hence, 
increases the magnetic energy stored in its vicinity. A more detailed dis- 
cussion will be given in Section 3.7. It is worth noting that the above equiva- 
lent circuit is valid only when the effect of the window on the region outside 
A and B is considered. If the equivalent circuit is used to discuss the region 
between A and B, wrong conclusions may be reached since the cutoff modes 
may not be adequately attenuated. For instance, suppose that an equivalent 
circuit of two inductive windows closely spaced to each other is expressed 
by two inductances inserted across the transmission line as shown in Fig. 3.9. 
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The value of each inductance is not generally equal to the value of the induc- 
tance for the same window situated an appreciable distance from other 
possible discontinuities. This is due to various interactions between the 
cutoff modes excited by the two windows, which do not exist in the case of a 
well-separated window. 

The parameters of the two-port network Z u , Z 12 , and ^22 can be 
obtained by measuring the input impedance at A with various loads 
connected to B. The input impedance at A can be determined from a Smith 
chart, once the standing wave along the transmission line, and hence the 
relative magnitude of E y in the waveguide has been measured as a function 




l 




Fig. 3.9. Equivalent circuit of closely located inductive windows (a) Waveguide top 
view; (b) Corresponding equivalent circuit. 

of z. This measurement is generally done using a movable probe inserted 
vertically into the waveguide through a longitudinal slot cut along the center 
line (x = ajl ) of the broad surface. Since the slot does not interrupt the wall 
current as explained before, the TE 10 mode disturbance is negligible. The 
probe couples to E y only, and the output indicates its relative magnitude 
when detected and measured on a meter. From this, the standing wave ratio 
and the voltage minimum point can be determined, which are sufficient to 
obtain the impedance. A slotted waveguide with a movable probe for this 
purpose is called a standing wave detector. 

The impedances required at B for the above measurement are provided 
by a movable short and a matched load. The former consists of a short- 
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circuiting metal plunger in the waveguide which completely reflects the 
incident wave. The position of the plunger and, hence, the phase of the 
reflected wave are adjustable, giving any impedance on the periphery of the 
Smith chart. The matched load usually consists of a resistive card longitu- 
dinally placed in the waveguide which completely absorbs the incident 
power, thus giving an impedance corresponding to the center of the Smith 
chart. The input end of the card is tapered to a point in order to eliminate 
reflections which might be caused by a sudden change in line properties. 
This technique of gradually changing line properties is often used to reduce 
reflections at discontinuities. That part of the line introduced for this purpose 
is called a tapered section. 

3.3 Introductory Eigenvalue Problem 

In the previous section, the problem of finding possible fields in the wave- 
guide was reduced to that of solving the differential Eq. (3.10) under the 
boundary conditions (3.11). The constant k 2 = y 2 + oj 2 eh had to be deter- 
mined in order to make the solution satisfy the boundary conditions at 
x = 0 and x — a. As illustrated by that example, the problem of solving a 
differential equation that has a constant to be determined under appropriate 
boundary conditions is called an eigenvalue problem. The solutions are 
called the eigenfunctions and the corresponding constant for each eigen- 
functions is the eigenvalue. We know by substitution that the eigenfunctions 
and eigenvalues of (3.10) and (3.11) are given by 

Eyn = A„ sin (mix fa), k 2 = y 2 + a> 2 efi = ( nnja ) 2 

respectively. Since eigenvalue problems play an important role in the theory 
of waveguides, let us reconsider the same problem using a more general 
approach. 

The k 2 ' s take discrete positive values in the above solutions, however, it 
is not clear whether or not other solutions exist in which the k„ 2 ' s are com- 
plex numbers. To investigate this point let E yn be an eigenfunction of (3.10) 
and (3.11). By definition, we have 

d 2 E yn , 

+ k 2 E yn = 0 (3.26) 

E yn = 0 (x = 0, x = a) (3.27) 

Since E yn may be a complex function, multiplying (3.26) by its conjugate, we 
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obtain 



E * d E yn k 2 e* E = — 

Eyn + Kn E yn t yn ^ 



+ = 0 

dx dx 



Integrating this with respect to x from 0 to a, we have 



£ * d l>» 

yn dx 



H 



dx + k. 



dx = 0 



where the limits of the integrals are from 0 to a , and [ ] 0 a indicates the differ- 
ence of values for the inside function at x = 0 and x = a, e.g., [/(x)] 0 ° = 
f(a) _/( 0). Since E yn is equal to zero at x = 0 and x = a, its conjugate is 
also zero at these two points, and the first term on the left hand side vanishes 
giving a simple expression for k 2 ; 

f \dEJdx\ 2 dx 



\E„\ 2 dx 



This expression shows that for a nonzero solution E y „, k 2 can take neither 
negative nor complex values, which is the point we wanted to check. Now 
suppose that E yn is a complex function, then since k 2 is real, the real and 
imaginary parts of E yn satisfy (3.26), separately. Therefore, each E yn can be 
considered as a real function without loss of generality. 

To see why the eigenvalues are discrete, let us next investigate (3.26) and 
(3.27) with the restriction that E y is real and k 2 positive. To satisfy the 
boundary condition at x = 0, E y starts from the origin as shown in Fig. 3.10. 
The solid and dotted lines illustrate the cases in which E y starts into the 
positive or negative direction, respectively at x = 0. Since E y obeys the differ- 
ential equation which can be rewritten in the form 



I d 2 E y 
E y dx 2 



= -k 2 



when E y is positive, d 2 E y ldx 2 must be negative, i.e., dEjdx must decrease 
with increasing x. On the other hand, when E y is negative, d E y ldx must 
be positive and dEjdx increases with increasing x. Therefore, E y must be a 
convex function of x which tends to return to the base line E y = 0. If k 2 is 
small, this tendency to return to the base line is weak resulting in a nonzero 
value of E y at x = a, as shown in Fig. 3.10(a). As k 2 increases, the curvature 
increases and a value of k 2 must exist which causes E y to become exactly zero 
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Fig. 3.10. Explanation of the existance of discrete eigenvalues. 

at x = a, as shown in Fig. 3. 10(b). This value of k 2 gives the first eigenvalue 
k 2 . The corresponding function E y is the first eigenfunction E yl . If k 2 
increases further, E y crosses the baseline and overswings at x = a, as shown 
in Fig. 3.10(c). Therefore, such a value of k 2 cannot be an eigenvalue since 
the boundary condition is not satisfied, but when k 2 increases further, another 
value occurs which causes E y to return to the baseline exactly at x = a. 
This is shown in Fig. 3.10(d). The corresponding k 2 and E y are the second 
eigenvalue k 2 2 and eigenfunction Ey2> respectively. In this way, k 2 can take 
only discrete values for E y to satisfy the differential equation and the 
boundary conditions. 

There is an important relation known as the orthogonality between any 
two different eigenfunctions having different eigenvalues and is given by 

J E yn E ym dx = 0 (n^m) 



(3.29) 
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where the integral is from x = 0 to x = a. The word orthogonality is employed 
here because of a similarity between (3.29) and the orthogonal relation 
between two vectors, 

A • B = A X B X + A y By •+■ A Z B Z — 0 

A function defined over a finite interval can be represented conceptionally 
by a vector in an infinite dimensional abstract space whose components give 
the values of the function at each point in the interval. The scalar product 
between two such vectors can be expressed as the infinite summation of the 
products of corresponding components, which is proportional to the integral 
of the product of the corresponding functions. Therefore, the orthogonal 
relation is given by equating the integral to zero which is essentially done 
in (3.29). There is another reason for this terminology related to power 
which we shall discuss shortly. 

The proof for (3.29) is as follows. Since E ym is an eigenfunction, it satisfies 

d 2 E vm , 

-T-jp + k * E v» = 0 (3-30) 



Multiplying (3.26) and (3.30) by E ym and E yn , respectively, and integrating 
their difference from x = 0 to a, we have 

j { E ’ m IP? - Eyn d ^ n ) dX +{k " 2 ~ k " 2) J E ” E ’ m dX = 0 (3 31) 

The first integral on the left-hand side can be transformed to 




dE ym dE yn ) f d / dE ym \ dE yn dE ym ) 

dx~ dx J (rfx^ yn dx ) dx dx J 




Since both E yn and E ym become zero at x = 0 and x — a, this is equal to zero 
and (3.31) reduces to 




E yn E ym 



dx = 0 



(3.32) 



{k 2 — k m 2 ) does not vanish by hypothesis, and hence (3.32) is equivalent 
to the orthogonality relation (3.29) which we wished to prove. 

In our case, the eigenfunctions are sinsoidal and (3.29) corresponds to a 



‘‘A**** 
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well-known formula 



sin (rnix/a) sin (mux/a) dx — 0 (n ^ m) (3.33) 

Jo 

This can be proved without relying on the above derivation as follows: 

f . nnx . mnx f f (n — m ) nx (n + m) 7rx) 

2 sin — sin dx = <cos cos > dx 

J a a J [ a a J 

[ a . (n — m) nx~f f a (n + m) nx~\ a 

— sin^ — - ; ; " sin =0 

(n - m) n a J 0 l(n + m) n a J 0 



In order to see the physical meaning of the orthogonality, let us calculate 
transmission power due to the superposition of two modes, n and m. From 
(3.6), the value of H x corresponding to E y = A„E yn is given by 

H x = -y n {j(onY l A n E yn 

Similarly, when E y = A m E ym then H x = - A m E ym . The super- 

position of these two modes gives 

E y = ( A„E yn + A m E ym ) , H x = - ( jeon ) “ 1 (A„y„E yn + A m y m E ym ) 

The power in the z-direction is given by the integral of the Poynting vector 
over the cross section of the waveguide, i.e.. 



Re Je x H* • k dS = Re JJ E y {- H x f dx dy 

= Re J J ( A n E yn + A m E ym ) (- 1 (A*y*Ey„ + A m *y m * E ym ) dx dy 

= Re |( — j(Dfi )~ 1 JJ (| Af y„*Ey n + A n A m *y m *Ey n E ym 



+ A m A*y*E ym E y „ + \AJ 2 y m *E ym ) dx 



Since the integral of E y „E ym with respect to x vanishes from the orthogonality 
relation, the above expression reduces to 

Re | (-ja>n)-' JJ (|X„| 2 y„*Ey n + \A m \ 2 ?m *£ 2 „) dx = P„ + P m 

where P„ and P m are the transmission powers of the individual modes n and 
m, respectively. From this, we conclude that when the orthogonality 
relation (3.29) holds, the total transmission power is equal to the summation 
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of the powers transmitted separately by each mode. Referring to the dis- 
cussion on the superposition of powers in Section 2.3, the above two modes 
n and m can be said to be orthogonal to each other, which is another reason 
for calling (3.29) the orthogonality relation. 

Let us introduce an expression k 2 (E y ) given by 

f 0 iEJdxf dx - 2 [£,(</£,/</*)]? 

e{E y ) = J - (3.34) 

I E y 2 dx 

where E y is restricted to be real and the integrals apply to the interval from 
r = 0 to x = a. For k 2 (E y ), the value is specified when a function of x, E y , 
is given while the value of a function of x is specified when x is given. In 
general, a quantity whose value is fixed when a function is given is called 
a functional. Thus, k 2 (E y ) is a functional of E y . 

Let k 2 and k 2 + Ak 2 be values of k 2 (E y ) corresponding to E y and 
E y -l- AE y , respectively, where AE y is a small change from E y . Since k 2 + Ak 2 
is given by the right-hand side of (3.34) when E y is replaced everywhere by 
E y + AE y , we have 

{k 2 + Ak 2 ) J (E y + AE y ) 2 dx = | {d(E y + AE y )ldx} 2 dx 

-2[_(E y + AE y ){d(E y + AEy)ldx}J 0 

where both sides have been multiplied by the denominator. Neglecting 
higher order terms of the small quantities Ak 2 and AE y , the above equation 
reduces to 




(3.35) 



It is worth mentioning that d AE y jdx is also assumed to be small and its 
higher order terms are neglected in the above reduction. This assumption 
imposes a rather stringent requirement on possible AE y s; namely, when 
AE y is said to be small in the following discussion, it means that not only 
the value of AE y itself but also its derivative is small. 
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The first term on the left hand side of (3.35) cancels with the first and the 
third terms on the right-hand side because of (3.34). The second term on the 
right-hand side can be rewritten in the form 




Therefore, (3.35) becomes 



2 J AE y + k 2 E^j dx + 2 ^E y + Ak 2 J E 2 dx = 0 (3.36) 

If E y happens to be one of the eigenfunctions, say E yn , the value of the 
expression k 2 (E y ) given by (3.34) is equal to the corresponding eigenvalue 
k 2 as we can see from (3.27) and (3.28). In this case, since E y „ satisfies both 
the differential equation and the boundary conditions, the first and second 
terms on the left-hand side of (3.36) vanish, and the first order variation 
Ak 2 of k\E y ) due to a small variation AE y from E yn is thus equal to zero. 
That is, if a trial function E y which is slightly different from a true eigen- 
function is inserted in the expression k 2 ( E y ), it gives an approximate value 
for the eigenvalue which is accurate up to the first order term of the difference 
AE r Conversely, if the first order variation Ak 2 is kept equal to zero for any 
small variation AE y from a function E y , then this function E y is an eigen- 
function for the following reason. Suppose (3.26) is not satisfied somewhere 
in the region under consideration, then AE y can be chosen in such a way 
that it has the same sign as that of d 2 E y jdx 2 + k 2 E y everywhere, provided 
the latter is not equal to zero. At the same time, d AE y jdx can be chosen 
to be equal to zero at jc = 0 and x=a. Therefore, for this AE y , the first 
term on the left-hand side of (3.36) becomes positive, the second term is 
zero and Ak 2 cannot be equal to zero. If, on the other hand, the differential 
equation is satisfied but the boundary condition at x = 0 or x = a is not 
satisfied, the sign of d AE y \dx can be made the same as that of E y at the 
boundary, giving nonzero Ak 2 again. 

It follows from the above discussion that the following two statements 
are equivalent: (a) The first order variation Ak 2 of k 2 (E y ) is equal to zero 
for arbitrary AE y s whose values and derivatives are small ; and (b) E y is an 
eigenfunction. In other words, the eigenvalue problem is equivalent to that 
of finding E y s which give stationary values of k 2 (E y ) with respect to small 
variations in E y . Such a functional k 2 (E y ) is called the variational expression 
for the eigenvalues. 
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Since k 2 (E y )>0 for all functions satisfying the boundary conditions 
(3.27), there must be a function which minimizes k 2 (E y ). Let E ,i be this 
function, then, for any small variation AE y satisfying the same boundary 
conditions, the first order variation Ak 2 from k 2 (E yi ) is equal to zero. 
Since the second and third terms on the left-hand side of (3.36) are zero, 
E yl must satisfy the differential equation. Otherwise, if the first term is made 
positive by choosing the sign of AE y to be the same as that of (d 2 E yl jdx 2 ) + 
-1 -k 2 E yl everywhere, then a contradiction is obtained. From this, it follows 
that E yl thus chosen is an eigenfunction. Next, let Eyi be a function which 
satisfies the boundary conditions and at the same time minimizes k 2 (E y ) 
with the additional condition that it is orthogonal to Eyi . Then, for any small 
AE y satisfying the boundary conditions, Ak 2 is again found to be zero as 
follows. Since an arbitrary function F can be written in the form 



FE yl dx 

F = E yl J -~ + 

J Ell dx 

where the first term is proportional to Eyl and the second term is orthogonal 
to Eyi, AE y can be decomposed into two parts, one proportional to E yl 
and the other orthogonal. From the definition of Eyl, it is obvious that Ak 2 
corresponding to that part of AE y orthogonal to E yl always vanishes. Let us, 
therefore, consider Ak 2 corresponding to the part of AE y proportional to 
Eyi. We have 



F — E 



J f - 

7 



E yJ dx 



Eyl dx 






+ k 2 E y2 ) dx 



d 

dx 



Eyl 



dE 



y 2 



dx 



- E 



y 2 



dEyl 

dx 



= \E 



dE 



y 2 



>i 



dx 



- E 



y2 



dx 



I 



+ E 



y2 



d z E 



yi 



+ k 2 E yX 



+ (k 2 2 -ki 2 ) 



dx 2 

E yl E y2 dx 



dx 



in which the first and second terms on the right-hand side disappear because 
of the boundary conditions for Eyl and Eyl and the orthogonality relation, re- 
spectively. Consequently, the first term on the left-hand side of (3.36) becomes 
zero for AE y proportional to E yl . The second term is zero from the boundary 
conditions for E y2 , thus leading to the conclusions that Ak 2 , corresponding 
to the part of AE y proportional to E yl , is equal to zero. Therefore, Ak 2 = 0 



*■ 





for any small AE y satisfying the boundary conditions. Applying the same 
argument used for Eyl, Ey2 is found to be another eigenfunction. Similarly, 
let £j, 3 be a function which satisfies the boundary conditions and minimizes 
k 2 (E y ) under two additional conditions of being orthogonal to E yl and 
Eyl' An argument similar to the above shows that E y 3 is also an eigen- 
function. In this way, adding the orthogonality conditions one by one, one 
can find an infinite series of eigenfunctions Eyl , Eyl, ••• with the corre- 
sponding eigenvalues satisfying 0 < k 2 ^ k 2 < ..., and k 2 thus obtained 
increases indefinitely with n, i»e., 

lim k 2 — oo 

n-* oo 

A proof as to why k 2 increases with n which can be generalized to two 
and three dimensional cases will be given in Appendix I. 

If we now assume the infinite growth of the eigenvalues, one of the most 
important properties of the set of eigenfunctions, called the completeness, 
can be derived as follows. Since an eigenfunction multiplied by a constant 
is still an eigenfunction with the same eigenvalue, let us first normalize every 
eigenfunction by multiplying with a proper constant, i.e., for every E yn , 
the relation 

J Ey, dx = 1 ( 3 . 37 ) 

is assumed to hold. In our case, E yn = yJ 2 a~ 1/2 sin(«7tjc/a) is the appropriate 
form of normalized eigenfunctions. For convenience, let us call a function 
piecewise-continuous when it is continuous except at a finite number of 
points in the domain of interest, i.e., in the interval from x = 0 to x = a 
in the present case. We shall also call it square-integrable when the integral 
of its square exists over the same domain. Let a function / satisfying the 
boundary conditions have a square-integrable derivative, but otherwise, be 
arbitrary. We define f N by 

/» = /-! A,E y „ ( 3 . 38 ) 

n~ 1 

where 

A, = jf-E„dx ( 3 . 39 ) 

Also, let a N 2 be given by 

“ » 2 = J Sh dx ( 3 . 40 ) 
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From (3.38) and (3.39), using the orthogonality and normalization condi- 
tions, we have 

«/ = j(/-lA„EJ 2 d x 

= | (/ 2 - 2 £ + Z dx = jf 2 dx-^\ 2 (3.41) 

From the definitions of f N and a N , we obtain 

f (/„/a„) 2 dx = 1 (3.42) 

and 

J (/w/«w) £ >n dx = 0 (ji < N) (3.43) 

Since f N /a N is orthogonal to E yn (n < AT) and satisfies the boundary conditions, 
k 2 (f N /a N ) cannot be less than k N 2 . This is due to the fact that E yN minimizes 
k 2 (E y ) to k N 2 while at the same time it satisfies the boundary conditions 
and is orthogonal to each E yn for which n<N. The denominator of k 2 (f N ja N ) 
is unity from (3.42), and hence we have 




where the boundary conditions, (3.39) and the orthogonality conditions are 
used. Noting that the second term on the lefthand side of the inequality is 



positive, we have 




On 2 | ( dffdx ) 2 
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or equivalently, 

a N 2 ^ 2 J (df I dx ) 2 dx 

It follows from this that 

lim a N 2 = 0 

N-* oo 

since J {dffdx) 2 dx has a finite value and k N 2 increases indefinitely with N. 
From the definition of a N , the above equation is equivalent to 

lim f ( 7 - Y A n E yn ) 2 dx = 0 (3.45) 

This means that an arbitrary function / which satisfies the boundary condi- 
tions and has a square-integrable derivative can be expanded in terms of the 
E yn ' s which implies that the integral of the square of the difference between 
/ and Yj A n E yn converges to zero. 

Let F be an arbitrary function which is square integrable and piece-wise- 
continuons. Then Fean be approximated by a function /which satisfies the 
boundary conditions and whose derivative is square-integrable, as shown 
in Fig. 3.11, so as to satisfy 

J (F - ff dx < s/4 

where e is a fixed, but arbitrarily small, positive value. On the other hand, 
from (3.45) 

J (f ~ E dx < e/4 





Fig. 3.11. Approximation of a discontinuous function by a continuous function with 
square-integrable derivative. 
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for a sufficiently large N. Therefore, we have 

J (f - "Z - /+ /- ! dx 

«2j(F-/) J dx + 2 jif-^A^fdx^e 

where use is made of the relation (2) in Appendix I. Since e is arbitrary, the 
above relation shows that a piecewise continuous and square-integrable, but 



otherwise arbitrary, function Fean be expanded in terms of the E yn ' s: 



lim I - (f- £A^Sdx = 0 

JV-co J \ n= 1 / 



(3.46) 



This property of the E y „’s is called the completeness. The above relation is 
usually written in the form 



F = Z AE, 

n = 1 



where 



'■-r. 



F • E yn dx 



(3.47) 



(3.48) 



The reason that (3.39) can be rewritten in the form of (3.48) is easily seen 
from the following: 

| J F-E y , dx - J /•£„„ = | J (f - /)•£„ dx j" 

« J (F- f) 2 dx^ El 4 

where use is made of (1) in Appendix I and the normalization condition 



for E yn . 



In the present case of the eigenvalue problem under consideration (3.26) 
and (3.27), the E yn ' s are sinusoidal functions and (3.47) is the Fourier 
expansion of F : 



where 



F = X 1/2 sin (nnx/a) 



A n = J F'yjla 1/2 sin (nnxja)dx 



Although we wrote (3.47) in the form of an ordinary equation, the real 
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meaning of the equality is given by (3.46) in which the equality does not 
mean that the value of the left-hand side of (3.47) is equal to the value of the 
right at each point in the domain of interest. For instance, it is well known 
that the Fourier expansion gives the mean value at each discontinuity point. 
This restriction does not bother us, however, since any measured value of a 
physical quantity at a point is actually the average over a certain region in the 
vicinity of that point. It is immaterial what value is given by the functions 
at each point as long as the average value over a neighborhood stays the 
same. In this sense, both sides of (3.47) are equal. 

With this understanding of the meaning of the equality, let us next study 
how to use it. Let the expansion of F and G be given by £ A„E yn and 
£ B„E y „, respectively. If F and G are equal in the sense that J (F— G ) 2 dx = 0, 
then A„ = B„ for each n. The proof is as follows. For a given £ no matter how 
small it may be, the following inequality can be satisfied with a sufficient 
large A where Lemma 2 in Appendix I is used twice for the second inequality. 

* /N- 1 N - 1 \ 2 

Z AE,„- Z B.E„) dx 

J \n = l n = 1 / 

=£ j(F-G + Y,A n E„-F + G-Y,BJ- y „Ydx 

<2 j(F-G) 2 dx + 4 j(F-Y,A„E y „) 2 dx + 4j (G - Z B„E y „) 2 dx 

< £ 

However, the left-hand side is equal to YJn=i (4„ — B„) 2 because of the 
orthogonality between the E yn ’s. It follows from this that A„ = B„ for each n, 
otherwise £ cannot be arbitrarily small. Conversely, if A„ = B n for every n, 

j(F-G) 2 dx = j(F- Z A„E yn + £ B„E,„ - G) 2 dx 

« 2 J (F ~ Z AE„) 2 dx + 2 J (G - X B,E„) 2 dx 

< 8 

Therefore F=G in the sense that \(F—G) 2 dx = 0. The same conclusion 
can be obtained by a slightly less rigorous but more popular method. 
Disregarding the real meaning of the equality between series expansions, 
first equate the two expansion forms as ]T A„E yn = £ B„E yn , next multiply 
both sides by E yn and integrate with respect to x over the domain of interest 
from x = 0tox = a, We are then left with only one term, A„ = B„ , because 
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of the orthogonality of the E yn 's. Conversely, if A„ — B„ for every n, obviously 
Z»=i A nEyn = 2?= i S n^ yn for all N, which means F = G. 

Another interesting formula is given by 

f F G dx = f A,B, (3.49) 

J n= 1 

To prove this, let us calculate the square of 

J (f - Z 1 A£„) (g - Y dx 

The result is 

{ J (F - 1 A * E J (G - E BJ y rfxj 2 

= {jfGdx-I^„J G-E„ dx-ZB'f F‘E yn dx + £ A„B,J 

= |J F-Gdx-YA n B^ 

However, because of Lemma 1 in Appendix I, the left-hand side is smaller 
than 

which can be made arbitrarily small. Thus, for a given e > 0, no matter how 
small it may be, 

{J FGdx ~ y"” b ”F <<! 

This inequality can be satisfied by making N sufficiently large, which is 
equivalent to the desired formula (3.49). In particular, when F= G, we have 

I* co 

F 2 dx= £ A » 2 

J n— l 

The same conclusion can be obtained by the following less rigorous method: 
( FGdx= f f A n E yn -l B n E yn dx= £ A„B n 

J J B=1 B=1 

where the orthogonality and normalization conditions between the E yn *s 
are used. 

Finally, it is worth mentioning that, when Fis complex, the real and imag- 
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inary parts can be expanded separately and then added together to obtain 
exactly the same formula as (3.47) and (3.48). In this case, (3.46) is replaced 

by ^ I TV — 1 2 

lim F - X A n E yn dx = 0 

n-*co J | n= 1 

Similarly, (3.49) holds even if F and G are complex. 

3.4 Eigenfunctions for Waveguides 

In Section 3.2, we investigated properties of electromagnetic waves in 
rectangular waveguides, assuming that the fields varied exponentially in the 
longitudinal direction and that E y was the only nonzero component of the 
electric field. In this section, eliminating the second assumption, we shall 
set up an eigenvalue problem for the transverse electric field in a straight 
waveguide with arbitrary cross section. The solutions of this problem play 
an important role in the theory of waveguides to be discussed in the next 
section. 

Because of the particular geometry of a waveguide extended in one 
direction, for example in the z-direction, and limited in other directions, it 
is convenient to first separate the fields into their longitudinal and trans- 
verse components : 

E = (E, + k E z )e~ yz (3.50) 

H = (H t + k H z )e' iz (3.51) 

where the exponential variation with respect to z is written explicitly so that 
E„ H„ E z , and H z become independent of z. Vector k is a unit vector in the 
z direction while E { , H, are the transverse components of the electric and 
magnetic fields, respectively. 

Let us substitute (3.50) and (3.51) into Maxwell’s equations 

V x H = jcoeE (3.52) 

V x E = — j'm/iH (3.53) 

where o is assumed to be zero. First, substituting into (3.52), we have 

Vx(H, + k H x ) e~ yz = j( 0 £ (E t + k E s ) e~ yz (3.54) 

The left-hand side of (3.54) consists of two terms, 

V x H t e yz = e- yz VxH, + \e~ yz x H t 
= e" yz V x H, - kye -yz x H f 



(3.55) 
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V x k H z e~ yz = e' yz \H Z x k + \e~ yz x k H z 

= - e~ yz k x \H Z - kye~ yz x k H z (3.56) 

where the ordinary rule for differentiating a product of functions is used. 

The second term on the right-hand side of (3.55) and the first term on the 
right-hand side of (3.56) are both perpendicular to k. The second term on the 
right-hand side of (3.56) is equal to zero because of its form k x k. In order 
to find the direction of V x H ( , let us consider 

kx(VxH,) = V (k • H ( ) - (k • V) H ( 

Since H, has no k component, the first term on the right-hand side is equal 
to zero. Because H, is independent of z and k-V = djdz, the second term 
also vanishes. Thus, k x (V x H t ) is found to be zero from which we can 
conclude that VxH, is parallel to k and the first term on the right-hand 
side of (3.55) gives the k-component only. Equation (3.54) can now be 
decomposed into two equations, one for the k-component and the other 
for the transverse component: 

V x H r = jcoekE z (3.57) 

yk x H, + k x \H Z = — jcoe E, (3.58) 

Similarly, from (3.53) we have two equations 

VxE, = — j(onkH z (3.59) 

yk x E, + k x VE Z = j(ofH t (3.60) 

Next, applying V* to (3.52), we obtain 

= 0 

since V*Vx is always equal to zero. A substitution of (3.50) into this 
equation gives 

(V • E, + V . k E z ) + (E, + k£ 2 ) ♦ \e~ yz = 0 | 

Since E z is independent of z and V*k = djdz, V*k E z is therefore equal to 
zero. In addition \e~ yz is equal to k( — y) e~ yz , and since E, is perpendicular 
to k, E f ‘Ve -yz also vanishes. Thus, we are left with 

V-E t = yE z (3.61) I 

-ik 

Similarly, we have from V* (3.53) i 
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We shall now try to obtain an equation for E, alone by eliminating H f , H z , 
and E z from the above equations. First, calculate V x (3.59): 

V x V x E, = - ju>n V x kH z 
= jcofik x \H Z 

Using (3.58), the above equation is shown to be equivalent to 

V x V x E, - (o 2 en E, = — jcopyk x H t (3.63) 

Subtracting V (3.61) from (3.63), we have 

V x V x E, - VV • E, - 0 ) z £fiE t = - jmnyk x H t - y \E Z (3.64) 

However, yk x (3.60) gives 

y 2 k xkxE, + ykxkx V£. = jconyk x H t 

which can be rewritten as follows using (2.14) and the fact that E, and V£ z 
are both perpendicular to k. 

- y 2 E t — y V£ z = jtofiy k x H ( 

Substituting this into (3.64), the desired equation for E ( is obtained: 

V x V x E f - VV-E f -/c 2 E f = 0 (in 5) (3.65) 

where S is the waveguide cross section and 

k 2 = (q 2 £H + y 2 (3.66) 

There are two boundary conditions: On the waveguide wall, the tangential 
component of E, must vanish, i.e., n x E r = 0, and, in addition, E z has to 
be zero, i.e., V»E f = 0 from (3.61). Thus, the problem is reduced to an 
eigenvalue problem of solving (3.65) under the boundary conditions 

nxE, = 0, V • E { = 0 (onL) (3.67) 

where L indicates the waveguide wall and n is the outer normal unit vector. 
Once E t is obtained, E z can be calculated from (3.61), H z from (3.59), and 
H t from (3.60). 

Following the procedure employed in the previous section, let us now 
study the eigenvalue problem for E t . We shall prove that the eigenvalues 
are real and nonnegative (zero or positive). Multiplying 

V x V x E ln — VV • E m — k„ 2 E tn = 0 



V • H, = yH z 



(3.62) 



(3.68) 
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by E* • and integrating the result over S , we have 
J {£* • V x V x E,„ - E* • VV • E,„ - k n 2 Ej • E,„} dS 

= J {(V x E*) • (V x E,„) + ( V • E*) (V • E,„) - k 2 E* ■ E,„} dS 



where the formula for integration by parts and the boundary conditions are 
used. From the above equation, we obtain 



J (|V x EJ 2 + |V-E 1Jt | 2 } dS 
J |EJ 2 dS 



which indicates that k„ 2 is real and nonnegative. Since k„ 2 is real, the same 
argument used in Section 3.3 shows that all the eigenfunction E fn ’s can be 
assumed to be real without the loss of generality. In order to show the 
orthogonality between the E f(1 ’s, let us next multiply (3.68) by E (ni - and 
subtract this same expression with the subscripts m and n interchanged. 
If the result is integrated over S and the formula for integrating by parts is 
used together with the boundary conditions, we obtain 



{K 2 ~ kj) J 



E,„ • E tm dS = 0 



It follows from this that the orthogonality relation 



j* E m • E fm dS = 0 {n±m) 



(3.70) 



holds between any two eigenfunctions with different eigenvalues. 

Inspection of the variational expression (3.34) and a comparison of it 
with (3.28) and (3.69) suggests that 

f {(V x E,) 2 + (V -E t ) 2 } dS - 2 (j) n x E, • V x E, dl 

k Z (E f ) = ? 1 (3.7D 

E, 2 dS 



may be an appropriate variational expression for the present eigenvalue 
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problem, where j dl indicates the contour integral along the boundary L. 
To verify this, let bk 2 be a first order variation corresponding to <5E t , a small 
variation from E,. When <5E, is said to be small, it means that in addition 
to the magnitude of bE„ the magnitude of the derivatives V «<5E, and V x <5E, 
are assumed to be small as before. Neglecting the higher order terms, we 
have from (3.71) 

k 2 j E, 2 dS + 2 k 2 J E, • <5E, dS + bk 2 J E, 2 dS 
= J {(V x E,) 2 + (V *E,) 2 } dS + 2 J {V x E, • V x <5E { + V *E,V • <5E,} dS 

- 2 j) (n x E t • V x E t + n x <5E ( • V x E f + n x E, • V x <5EJ dl (3.72) 

Using (3.71) and the formula for integration by parts, (3.72) reduces to 

bk 2 J E, 2 dS = 2 J bE t *(V x V x E, - VV -E, - k%) dS 

+ 2 (j) {(n • 5E,) (V • E f ) - n x E t • V x <5E,} dl (3.73) 

It follows from (3.73) that if E, is an eigenfunction, the first order variation 
5k 2 corresponding to any small <5E t vanishes. Conversely, if bk 2 is equal to 
zero for every possible small variation <5E r from E ( , E, is an eigenfunction 
for the following reasons. 

(i) If the differential equation is not satisfied somewhere in S, 5E, can 
be made parallel to V x V x E, - VV-E, - Ar 2 E t and both n-<5E, and V x bE, 
can be made equal to zero on L. Why V x <5E, can be made zero when n • <5E, = 0 
will be seen if we write V x (5E t in the form i z ((3 bEJdn^d SEJd,)}, where 
bE n and bE t are the normal and tangential components of <5E„ respectively. 

(ii) If the differential equation is satisfied but the boundary condition 
V*E, = 0 is not, then n-5E, can be made to have the same sign as V*E t 
along L and V x <5E, can be made to vanish on L. 

(iii) Tf the differential equation and the boundary condition V -E, = 0 
are satisfied but n x E, = 0 is not, then V x <5E t can be chosen so as to have 
the same sign as nxE, on L. In every case, bk 2 takes a nonzero value. 
Thus, we conclude that (3.71) is indeed a variational expression for k 2 . 

Once the variational expression is obtained we can derive an infinite 
series of eigenfunctions conceptionally as we did before. Let us consider a set 
of all functions which are real and which satisfy the boundary conditions. 
Among them, let E,i be a function which minimizes /c 2 (E,), then, E (1 is an 
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eigenfunction with the smallest eigenvalue k x 2 ^ 0. Next, let E t2 be a 
function which minimizes k 2 ( E ( ) with the additional condition that it is 
orthogonal to E fl . Then, E (2 is another eigenfunction with kf^kf. In 
this way, adding orthogonality conditions one by one, a series of eigen- 
functions E a , E, 2 , E f 3 , . . . can be obtained. The corresponding eigenvalues 
satisfy the relation 0 ^ k 2 ^ k 2 < k 3 2 ... . Note that the E,„’s thus obtained 
satisfy (3.70) even if k 2 = k m 2 as long as n ^ m. A proof will be given in 
Appendix I to show that k 2 increases indefinitely with n,i.e., lim n _ 0O k 2 = co. 

Let us next prove the completeness of the set of eigenfunctions assuming 
the infinite growth of the eigenvalues. To do so, we first normalize all the 
eigenfunctions, i.e., 

J E 2 „ dS = 1 (3.74) 

Let f be an arbitrary function which satisfies the boundary conditions and 
has derivatives V*f and Vxf which are square-integrable over S. We 
define by 

f„=f- £ A, E,„ (3.75) 

n = 1 

where 

y)„ = Jf-E,.rfS (3.76) 

and a N 2 by 

a„ 2 = f f„ 2 dS (3.77) 

Using the orthogonality and normalization conditions for the E,„’s, (3.77) 
can be written in the form 

»„ 2 = JVl ^.E„) 2 dS = J f 2 dS - t V (3-78) 

From the definition of f N , f s ja N is orthogonal to all the E,„’s for which n is 
smaller than N, i.e., 

J* (fjv/ a «) " E f „ dS = 0 (n<N) (3.79) 

Since E lN gives the smallest value k N 2 of k 2 ( E,) under the same ortho- 
gonality and boundary conditions as for f N /a N , k 2 (f N la N ) cannot be less 
than k N 2 . Noting that l N ja N is normalized by definition, we calculate 






3.4. Eigenfunctions for Waveguides 



119 



* 2 (f nM as follows : 

fc 2 (f„K) = a ; 2 J {(V x f w ) 2 + (V-f N ) 2 } dS 

= a; 2 J {(V x f - V x I>1„E,„) 2 + (V f - ? •£ /l„E„) 2 } dS 

= a ; 2 | [(V x f) 2 +(V.f) ! -2S4(^(fxVx E,„) 

+ f . V x V x EJ - 2 £ A n {V ■ (fV • E,„) - f • VV . E,„} 

+ II A * A m * ( E *n >< V X E 'm) + E,„ • V X V X E, m } 

+ II A n A m { V . (E,„ V • E, m ) - E,„ . VV • E, m }] dS 

= a ; 2 J {(V x f) 2 + (V -f) 2 - 2 Y. A„f-k 2 E„ 

+ YL A *A m E ln -kjE m ) dS 

= a', 2 f {(V x f) 2 + (V-f) 2 } dS - — 2 f A*k „ 2 
J <^N " - 1 

where the boundary, orthogonality and normalization conditions are used. 
Noting that this has to be larger than k N 2 and that the last term is positive, 
we have 

J{(vxf) 2 +(v.f) 2 }ds »*„ 2 

which is equivalent to 

«» 2 « fc; 2 J {(V x f) 2 + (V -f) 2 } dS (3.80) 

Since the integral is finite and k N 2 increases indefinitely, a N 2 approaches 
zero with increasing N. From the definition of a N , we have 

lim f (f- £ A n E tn 
N-* oo J \ n= 1 

which shows that f can be expanded in terms of the eigenfunctions. 

Let us call a function defined over a two-dimensional domain piecewise- 
continuous when it is continuous in the domain except along a finite number 
of lines each having a finite length. Let F be a piecewise-continuous and 
square-integrable function defined in S, the cross section of the waveguide. 
The function F does not have to satisfy the boundary conditions, but it can 
I / be approximated by a function f, which satisfies the boundary conditions 
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and has square-integrable derivatives, in the sense that 



J (F - f ) 



- f) 2 dS < c/4 



(3.82) 



where e is an arbitrary small positive number. This is possible because a 
continuous function f can be constructed in such a way that f is equal to F 
outside 5(e) while both its magnitude and direction continuously change 
inside 5(e), where 5(e) indicates the small area which completely contains 
the lines of discontinuity of F. It is always possible, therefore, to satisfy 
(3.82) by making 5(e) sufficiently small. Once, (3.82) is obtained, an argu- 
ment similar to the one used in the previous section to obtain (3.46) shows 
that 



lim f (f-Y dS = 0 

N-KX.J \ n= 1 / 



and the E ( „’s are complete. In other words, F can be expanded in terms of 
the E ,’s: 



F = I A n E tl 



where 



F*E,„ dS 



(3.84) 



Let F = i A„E,„ and G = x B n E (n . If F is equal to G in the sense that 
J(F — G) 2 dS = 0, then A n = B„ for each n and vice versa. Furthermore, 



f F-GJS= £ A n B n 
J n = 1 



The proofs for these assertion are almost identical with those for the one- 
dimensional case. 

Now consider F x k. Since it satisfies the conditions for the expansion 
to be possible, we have 



F x k 



00 /* 

= I E,„ I 

n- 1 J 



F x k *E,„ dS 



After multiplying both sides by k x , a little manipulation yields: 



F= £ k x E,„ f F • k x E In dS 
11=1 J 



*s,fcts.u' • 
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This means that F can be expanded in terms of the k x E,„’s: 

F = £ B„k x E,„ (3.86) 

n = 1 

where 

B n = J F-k x E f „ dS (3.87) 

Either (3.83) or (3.86) can be used equally for the expansion of F; however, 
if n x F is small on L, (3.83) is more suitable since the convergence of 
the series is generally better. On the other hand, if n*F is small on L, since 
n«k x E,„ = — k*n x E ln = 0, k x E tn satisfies a condition similar to that for 
F and (3.86) then becomes the first choice. Although there is no general proof 
for the somewhat ambiguous statement above, the situation may become 
plausible if we consider the following extreme case. If E { , is expanded in 
terms of the E,„’s, only one term is necessary. On the other hand if the 
k x E,„’s are used, since the tangential component of each k x E,„ on L does 
not vanish, a large number of terms are necessary to resemble E tl whose 
tangential component on L is zero. 

When F is complex, the real and imaginary parts can be expanded 
separately and then added together to obtain the same formulas as (3.83) 
and (3.86). 

Each eigenfunction E f „ belongs to one of the following four groups, since 
they exhaust all possible combinations. 

I. V x E r „ = 0 , V.E,„ = 0 

II. VxE^O, V-E,„ = 0 

III. V x E,„ = 0 , V • E f „ 0 

IV. VxE f „*0, V.E w *0 

Our next task is to show the following: A complete set of eigenfunctions can 
be derived such that each function in the set belongs to any one of the first 
three groups. 

Using E,„ belonging to group IV, we can define two new functions, 

E/ = A\ x V x E tn , E; = BVV • E t „ 

where A and B are normalizing constants for E/ and E t ", respectively 
E (n can be rewritten in the form 

E„ = k- 2 {(HA) E/ - (1 IB) E;'} (3.88) 

where use is made of (3.68). Since k„ 2 # 0 from (3.69), (3.88) shows that 
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E fn can be expressed as a linear combination of E/ and E," in which neither 
E/ nor E t " vanishes. If E/ is equal to zero, we have V x E,„ = 0 from (3.88) 
contradicting the assumption that E ln belongs to group IV (V x E f " is equal 
to zero due to its form, a constant times V x V(V-E,„)). Similarly, if E," is 
equal to zero, V*E,„ becomes zero contradicting the same assumption. 
Substituting E/ into V x V x (3.68), we obtain 

V x V x E/ - k n %' = 0 

Since V*E,' is equal to zero from the definition of E,', V V • E/ can be added 
to the left-hand side without changing the value, i.e., 

V x V x E/ - VV -E/ - k„ 2 E/ = 0 (in S) 

which is exactly the same differential equation as (3.65). Next, n x E,' can 
be calculated from (3.68) as follows. 

n x E/ = An x V x V x E,„ = An x k„ 2 E, n + An x V (V • E fn ) 

The first term on the right-hand side is zero because of the boundary 
condition for E ln . Except for a constant factor, the second term can be 
rewritten in the form 

n x V (V • E,„) = n x (n {d/dn) + /(d/3/)} (V • E,„) 

= nx/{d(V.E (n )/3/} 

However, since V*E ( „ is equal to zero along L, the derivative with respect 
to / also must be equal to zero, and as a result, we have nxE/ = 0. Since 
V*E/ is always equal to zero from the definition of E/, E f ' satisfies 

nxE/=0, V • E/ = 0 (onL) 

which give the same boundary condition as (3.67). From these observations, 
we conclude that E/ is an eigenfunction. Since both E/ and E (n satisfy the 
differential equation and the boundary conditions all of which are linear, 
the linear combination of E/ and E ( „, namely E t ", must satisfy the same 
equation and boundary conditions. Thus, E/' is another eigenfunction. The 
orthogonality relation between E/ and E/' can be established as follows: 

J E/ *ET dS = AB J (V x V x E,„) • (VV • E tn ) dS 

= AB j V • (V • E,„) (V x V x E,„) dS - AB J (V • E tn ) V • V x V x E ln dS 
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The first term on the right-hand side can be converted to a contour integral 
by Gauss’s theorem, and since V*E,„ vanishes on L, it is equal to zero. 
The second term is, likewise, equal to zero because V*V x vanishes. 

Now suppose that eigenfunctions are obtained successively by adding an 
orthogonality condition each time and we find that the nth function happens 
to appear in group IV for the first time. Instead of taking E,„ itself, let us 
take E/ derived from it as the nth function and E," as the n + 1st function. 
E/ and E t " are orthogonal to E tm (m < n). For example, the proof for E/ 
to be orthogonal to E tm (m < n) is as follows. If E fm belongs to group I or III, 
V x V x E fm is equal to zero ; whereas, if E lm belongs to group II, V x V x E, m 
= k m 2 E rm . In either case 

J E lm 'E/ dS = A j E, m .VxVxE,„rfS 

= ^JvxVxE m .E,„(/S=0 

where the formula for integration by parts and the boundary conditions 
are used. The proof for E," is similar. 

The n + 2nd function can be obtained so as to minimize the variational 
expression for k 2 under the condition that this function is orthogonal to all 
the functions up to the n + 1st. In this way, without the loss of completeness, 
a set of orthogonal eigenfunctions can be derived such that each function 
belongs to any one of groups I, II, or III. Hereafter we shall assume that 
this has been done. 

The electromagnetic waves derived from the s in group I have no 
longitudinal components, since E z = 0 and H z = 0 from (3.61) and (3.59) 
when y ^ 0. These waves are called transverse electromagnetic modes or, in 
short, TEM modes. Similarly, for the waves derived from groups II and III, 
we have E z = 0 and H z = 0, respectively. Thus, transverse electric modes, or 
TE modes, are derived from group II and transverse magnetic modes, or 
TM modes, from group III. From this, one might be temped to conclude 
that all the electromagnetic fields in a waveguide with a homogeneous 
medium and perfectly conducting walls can be divided into three groups, 
TEM, TE, and TM modes. We must recall, however, that an assumption was 
made at the beginning of this analysis that the fields vary exponentially with 
z. Because of this rather stringent restriction, the above fields may not repre- 
sent the most general case. There is no guarantee that an electromagnetic 
field does not exist which is not expressible as a linear combination of the 
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waves obtained above. To obtain this guarantee, we shall proceed a little 
further in the next section. 

Let us now consider TEM modes. Since V x E,„ = 0 in S and n x E,„ = 0 
on L for E,„ from group I, two-dimensional Helmholtz’s theorem shows 
that it can be written in the form 

E (n = Vq> (3.89) 

Since V‘E (n = 0 in S and n x E,„ = 0 on L, (p satisfies 

V . \q> = 0 (in S) (3.90) 

n x \(p = 0 (on L) (3.91) 



The other two conditions, V x E In = 0 in S and V • E r „ = 0 on L are auto- 
matically satisfied. The boundary condition (3.91) is equivalent to dq>jdl = 0, 
which means that (p is a constant along each connected boundary. From 
(3.89) and (3.90), cp can be considered as a two-dimensional potential 
function and E rn as the corresponding static field. 

Suppose that L is a singly connected contour, then by inspection, <p = 
constant satisfies (3.90) and (3.91) which gives E,„ = 0. This is, of course, 
what we expected since in an empty space completely enclosed by a conductor 
wall, no static field can exist. The proof is as follows: Since (p is a constant 
on L, we have 



J V ♦ {(p \(p ) dS = J (p V(p • n dl = const J n • \q> dl 

= const J (V • \<p) dS = 0 



where Gauss’s theorem and (3.90) are used. On the other hand, the left-hand 
side can be written in the form 



J V • (q> V<p) dS ~ j (V<p • \(p + cp\ • \(p) dS = J (V<p) 2 dS 



where (3.90) is again used. Combining these two equations, we conclude that 
E ln = \(p = 0 in S. This means that in a waveguide with a singly connected 
boundary, a TEM mode cannot exist. 

The above argument does not hold if there are two or more independent 
boundaries. In this case, <p can take a different value on each boundary and 
since there are (N- 1) independent ways of assigning potentials between N 
conductors, (N-l) independent solutions belong to group I. For instance, 
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in a coaxial transmission line where N is equal to two, there is only one 
TEM mode. 

Note that since V x E rn = 0 and V -E In = 0 for E,„ from group I, Eq. (3.69) 
shows that k„ 2 - 0. On the other hand, k„ 2 # 0 for groups II and III. 

Let us now consider TE modes, i.e., group II. By definition, V x E f „ is 
not equal to zero for E,„ from group II. Let us write this nonzero V x E,„ 
in the form 

V x E,„ = kk„H zn (3.92) 

This is possible because k x V x E,„ = V(k-EJ - (k-V) E (n = 0 and the 
transverse component of V x E In always vanishes. The notation H zn is used 
since it is equal to the z-component of the magnetic field of the mode 
except for a constant factor. Substituting (3.92) into V x (3.68) and noting 
that k n ^ 0, we have 

V x V x k H zn - k 2 kH zn = 0 

Since 

V x V x k H zn = - V x k x \H zn = 

- k (V • VHJ + (k • V) \H zn = - kV • VH zn 
the above equation is equivalent to 

+ k^H,, = 0 (in S) (3.93) 

The boundary condition V-E ln = 0 on L is automatically satisfied. On the 
other hand, since n x E,„ can be written in the form 

n x E,„ = k ~ 2 n x V x V x E„, = k~ x rv x V x k H zn = k ; 1 k (n . V// zn ) 

the boundary condition nxE,„ = 0 gives 

n>\H z = 0 (on L) (3.94) 

Suppose H zn and H zm correspond to E,„ and E, m , respectively, then we have 

J dS = (k n kj * J V x E,„ ■ V x E, m dS 
= (kjk„)j E„.E,„ dS 

where the formula for integration by parts and the differential equation for 
E tm are used together with the conditions V • E, m = 0 in 5 and n x E ln = 0 on L. 
This equation shows that the H.fs obtained from the E,„’s satisfy the 
orthogonality and normalization conditions, i.e., they are orthonormal to 
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each other. If n # m, the right-hand side vanishes, and if n = m, it becomes 
unity. 

It follows from this that a set of orthonormal eigenfunctions of (3.93) 
and (3.94) can be derived from the E,„’s belonging to group II. Conversely, 
a set of the eigenfunctions belonging to group II can be derived from the 
eigenfunctions of (3.93) and (3.94) using 

- k x \H zn = k n E ln (3.95) 

where k n ± 0 is assumed. To show this, let us first consider V-E (n . Since 

V • E, b = — K 1 V . (k x V// zn ) = K 1 k • (V x \H zn ) = 0 

always V-E, n vanishes. Next, the boundary condition nx E f „ = 0 can be 
checked by the following calculation. 

n x E,„ = — k ” 1 n x k x \H Z „ = - k n 'k(n • \H zn ) = 0 

Finally, writing (3.95) in the form 

m„ = /c„ kxE,„ j 

and substituting into V (3.93), we obtain ; 

V x V x E ln - k n 2 E, n = 0 

where we have used 

•.A 

-k x V x V x E,„ = — V (k • V x E t „) + (k • V) (V x E In ) 

= V (? ♦ k x E tlf ) J 

together with the fact that both V x V x E,„ and E,„ have no k components. 

Since V-E m = 0, the above result shows that E,„ satisfies (3.68). Thus, we 
have shown that E (n derived from (3.95) belongs to group II. Furthermore, 
since > 

| E,„.E m dS = (k,kJ' J | 

= (kjk,) j H„H zm dS 

if the H :n ' s are orthonormal to each other, so are the E,„’s. The above 
discussion shows that all the E, n ’s in group II can be derived from the 
independent solutions for (3.93) and (3.94) with the condition k n 2 ^ 0 
and vice versa. In other words, there is a one-to-one correspondence between 
the E,„’s in group II and the H zn ' s for which k n 2 ± 0. It should be noted here 
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that the eigenfunctions of (3.93) and (3.94) can form a complete set of ortho- 
normal functions. The proof for the completeness is essentially the same as 
before if we use the variational expression 

f (vu,) 2 ds 

k 2 m= r (3.96) 

j> d S 

In the above one-to-one correspondence, if we take all the eigenfunctions, 
the only missing one is that for k„ 2 = 0. From (3.96), \H, = 0 if the eigen- 
value is equal to zero; therefore, H, — constant is the missing eigenfunction. 
Thus, we conclude that any piecewise-continuous square-integrable scalar 
function defined in S can be expanded in terms of the (k • V x E,„/^ n )’s 
except for the constant term. 

One might fear that the particular choice of functions in group III may 
prevent some of the E,„’s in group II from appearing in the selection of the 
complete set of eigenfunctions thus making the (k*V x E,„/A: n )’s incomplete 
even after a constant term is added. However, this kind of interference 
among different groups does not exist since every function in group III is 
automatically orthogonal to any possible function in group II. This is 
shown by the following calculation. 

J E,„ • E„ dS = - k~ >k m 2 J (VV . E,„) • (V x V x E, m ) dS 

= - K 2 k~ 2 J {V-(V-E,„) (V x V x E,J - V.E„V-¥ x V x E„} dS 
= 0 

where E,„ and E, m are assumed to belong to group III and group Irrespec- 
tively. 

Let us now turn our attention to TM modes, i.e., group III. Defining 
E zn through 

V-E ln = k n E zn (3.97) 

and substituting into (3.68), we have 

\-VE z „ + k 2 E zn = 0 (inS) (3.98) 



The boundary condition V *E,„ = 0 gives 

E zn = 0 (on L) 



(3.99) 



128 



3. WAVEGUIDES 



On the other hand, n x E fn = 0 is satisfied automatically if (3.99) holds, 
since 

n x E,„ = n x (— k ; 2 ) VV .E <lt = - k ~ ‘n x \E zn 

= - k~ l u x {n (d/dn) + l(d/dl)} E zn = 0 (3.100) 

Furthermore, from 

| dS = (k n kj~ 1 J V • E,„ V • E, m dS 
= (kjk n ) j E,„.E, m </S 

it follows that if the E (n ’s are orthonormal to each other, so are the E zn ’ s. 
In other words, from the set of eigenfunctions belonging to group III, a set 
of orthonormal eigenfunctions for (3.98) and (3.99) are derived. Conversely, 
we can derive the E,„’s belonging to group III from the s using the 
relation 

-\E zn = k„ E,„ (3.101) 

Obviously, VxE,„ = 0 is satisfied. A substitution of (3.101) into (3.98) 
gives VV • E,„ + k„ 2 E In = 0 which is equivalent to (3.68) since V x E t „ = 0. 
The boundary condition V*E tn = 0 on L is obtainable when (3.98), (3.99), 
and (3.101) are combined. The other boundary condition nxE,„ = 0 is 
obvious from (3.100). Furthermore, since 

J E,„ • E, m dS = (k.kj- 1 J • \E 2m dS 
= {kJK) J E In - E zm dS 



the E tn 's thus obtained are orthonormal to each other provided that the 
E zn ’ s are selected to be orthonormal. It follows from the above discussion 
that there is a one-to-one correspondence between the E,„’s in group III 
and the eigenfunctions for (3.98) and (3.99). These eigenfunctions can from 
a complete set of orthonormal functions. The variational expression neces- 
sary for the proof is given by 





(3.102) 
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Thus, we conclude that an arbitrary piecewise-continuous square-integrable 
scalar function defined in S can be expanded in terms of the (V*E, n /Ar n )’s. 
In the above discussion, we did not consider the possibility of k n 2 being 
equal to zero. If k„ 2 is equal to zero, \E. = 0 in S from (3.102) which means 
that E z is a constant in S. However, since E z = 0 on L, this constant must 
be zero and the constant term is not necessary to form a complete set in the 
present case. 

The above discussion used E,. However, H, can be used equally well to 
replace E,. The equation for H t corresponding to (3.65) becomes 

V x V x H, — VV • H f — k 2 H, = 0 (in S) 
the boundary conditions are 

n • H, = 0 , n x V x H, = 0 (on L) 

A variational expression for k 2 is given by 

j {(V x H,) 2 + (V • H,) 2 } dS- 2 (j) (n • H ( ) (V • H,) dl 
k 2 (H,) = l 

H, 2 dS 

It follows from this that a complete set of orthonormal eigenfunctions can 
be formed. Furthermore, it can be shown that the H,„’s can be chosen in 
such a way that each of them belongs to any one of the three groups which 
lead to TEM, TE, and TM modes. The proofs for these statements are very 
similar to those for the E,„’s and hence we do not repeat them. 

This has been a lengthy section, therefore, it may be worth summarizing 
the discussion. First, an exponential variation of the fields is assumed and 
then an equation is found which the transverse electric field must satisfy. 
This equation contains a constant which is selected so that the solutions 
satisfy appropriate boundary conditions. It is shown that there is an infinite 
number of independent solutions satisfying the boundary conditions. When 
properly chosen, they form a complete set of orthonormal functions, each 
of which belongs to one of three groups leading to TEM, TE, and TM 
modes. Let the E tn ’s indicate this set of solutions. Then, an arbitrary 
piecewise-continuous and square-integrable vector function defined over 
the waveguide cross section can be expanded in terms of the E r(t ’s as well as 
in terms of the (k x E (n )’s. Furthermore, an arbitrary piecewise-continuous 
and square-integrable scalar function defined over the waveguide cross 
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section can be expanded in terms of the (k -V x E,„/fc„)'s and a constant term 
as well as in terms of the (V • EJArJ’s where the terms for which k„ 2 =0 
are excluded. Note that V x E,„ and V*E,„ for E (n from the TM and TE 
groups, respectively, are equal to zero and do not appear in the above 
expansions. 

How to eliminate the assumption of the exponential field variation is a 
subject of the next section. 



3.5 General Theory of Waveguides 

In this section, we shall derive the most general form of an electromagnetic 
field in a straight lossless waveguide with a uniform cross section. To do so, 
we first observe that E, H, V x E, and V x H in Maxwell’s equations are all 
well behaved (i.e., piecewise-continuous and square-integrable) functions, 
each of which can be expanded in terms of an appropriate set of functions 
studied in the previous section. Then, by substituting their expanded forms 
into Maxwell’s equations, all the expansion coefficients and, hence, the 
electric and magnetic fields will be determined. Since no a priori assumption 
is required on the functional forms of the fields such as the exponential 
variation with z, this method should give all the possible solutions of Max- 
well’s equations in the waveguide. 

Considering the similarity of the boundary conditions, we expand the 
transverse component of E in terms of the E,„’s and the longitudinal compo- 
nent in terms of the (V*E, n /ft n )’s: 

E = I E„ j E • E,„ dS + l k ^ j k • E V J" dS (3.103) 

where the real and imaginary parts are expanded separately and then added 
together, and the summation is from n = 1 to n = oo. Similarly, we use the 
k x E tn ’s and the (V x E tB /£ n )’s for the expansion of H: 



H=ZkxE 1B 



H k x E ( „ dS 






The last term corresponds to the constant term which is necessary to expand 
an arbitrary scalar function in terms of the (k • V x E t Jk n )'$> as we explained 
in Section 3.4. The normalizing factor is l /S' 12 . 
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Next, noting that V x E is an H-like function, we expand it in terms of 
the k x E, n ’s and the (V x E, n /& n )’s: 



VxE = £ kxE r»J VxE * kxE 

rVxEJ VxE. 

+ ) — VxE 

L K J k„ 



+ L ~K~ J V X E ' K ^ + S 1/2 J V X E ^ ( 3 - 105 ) 

Since we are going to substitute these expanded forms into Maxwell’s 
equations and compare the expansion coefficients, let us transform each 
expansion coefficient in (3.105) into a form which can easily be compared 
with the corresponding coefficient in (3.104). To this end, we write VxE 
in the form 



V x E = jV, + k — j x (E, (z) + k £ z (z)} 

= V, x E t (z) - k x V t £ z (z) + ^-kxE,(z) (3.106) 

oz 

where E ,(z) and k£ z (z) indicate the transverse and longitudinal components 
of E emphasizing that they are functions of z in contrast to the E^’s which 
are independent. The first term on the right-hand side of (3.106) has only the 
k-component while the remaining terms have only the transverse component. 
With the help of (3.106), the first expansion coefficients in (3.105) can be 
rewritten in the form 



V x E • k x E tn dS = k x E f (z) — k x V,£ z (z) j 



k x E.„ dS 



= -jE ( (z). E f „dS-J \ t E z (z).E tn dS 

Using the relation 

V,£ z (z) • E,„ « V f • {E z (z) E tn } - £ z (z) V • E tn 

the second integral on the right-hand side can be rewritten further in the 
form of a contour integral plus a surface integral. The final result is 



V x E-k x E, n dS = — E, (z) • E,„ dS 
dz 



+ k • EV ♦ E tn dS — (h E 2 (z) n • E,„ dl (3.107) 



Similarly, we can rewrite the other coefficients as follows. 



/ : |. 
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f V x E, n . V x E fn 

VxE dS = V f x E, ( 2 ) dS 

J K K 

= k n E-EJS+OnxE^—J/ (3.108) 

J J 

k , . k fk*nxE,(z) , ; 

V x E . ^5 dS = V, x E,(z) ■ s ,, 2 dS = (j) S , , 2 ' V dl (3.109) 

For the field in a waveguide with perfect conductor walls, the last terms in 
(3.107) and (3.108) become zero because of the boundary conditions 
E z (z) = 0 and n x E,(z)=0 on L. Similarly the right-hand side of (3.109) 
vanishes. 

For the expansion of V x H, we use the E,„’s and the (k\ •E t Jk„) , s. 



V x H = £ E ln VxH-E tn dS + 



kV-E, 



V»E 

VxH-k - dS (3.110) 

k„ 



Using an expression for V x H similar to (3.106) and following the method 
employed to obtain (3.107), the coefficients in (3.110) are calculated to be 



V x H-E,„ dS = 



d c 

H-k x 

dz J 



E tn dS+ H • V x E,„ dS (3.111) 



J V x H • k dS = J H 



k x k.E,„ dS 



(3.112) 



where n x E (n = 0 and V-E,„ = 0 on L are used to eliminate contour integrals. 

Substituting (3.103) and (3.1 10) into V x H = j'weE and writing the trans- 
verse and longitudinal components separately, we have 

Z E '" { - J z J H * k X dS + J H • V X E tn dS j = j<DE £ E,„ J E - E,„ dS 

(3.113) 

V^V*E P C V*E 

U‘ky.k n E tn dS=j(oe^— f ^j k-E ~^dS (3.114) 

where (3.111) and (3.112) are used. Similarly, we obtain two equations 
from V x E = — ja>fi H : 

E k x E fn Je -E, b dS + J k -EV -E fn dsj 

= — jcofi £ k x E f)1 j* H • k x E,„ dS (3.115) 
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IHN- 



k„ E-E,„ dS 



V x E (n f V x E, n k f k ) 

~k^ dS + S m H *S^ rfS f (3 * 116) 



Equating the coefficients of the corresponding E,„ on both sides of (3.1 13), 
we have 

d r _ . r r 



H-kxE fn dS + H-VxE tn dS=j(oc E-E ln dS (3.117) 

dz 



Similarly, (3.114) gives 



f V *E 

H -k x k n E m dS = joe | k • E dS 



(3.118) 



provided that V-E fn is not equal to zero. In much the same way, (3.115) 
gives 

4~ fE-E Jn d5+ fk-EV-E ln dS = -jcofi fH-k x E tn dS (3.119) 



and finally (3.116) gives 



fc„ J E - E.„ rfS = -;^Jn- V * k — dS 


(3.120) 


[H-^dS = 0 


(3.121) 



provided that V x E,„ is not equal to zero. 

Suppose E (n belongs to group I defined in Section 3.4 then (3.117) and 
(3.1 19) together with V x E ( „ = 0 and V-E,„ = 0 give 



d f 

H • k x E fn dS = jcoe E • E tn dS 

dz J J 



— J E-E IM dS = — jcofi J H-k x E,„ dS 

Eliminating | H-k x E fn dS from these equations, a differential equation 
for J E • E t „ dS is obtained : 

~ Je • E,„ dS + afen j* E -E,„ dS = 0 
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When y 2 = 0, the corresponding solutions are 



E • E w dS — C„ + D„z 



| H • k x E ln e/S = — D„ (3.123) 

f H ■ dS = - (C„ + D n z) 

J K J°>B 

If E,„ belongs to group 111, since V x E,„ = 0, then (3.1 17) reduces to 

- j j*H -k x E tn dS =jtoe | E -E t „ dS 

An elimination of J H*k x E,„ dS and J k*EV*E f „ dS from (3.1 18), (3.119) 
and the above equation gives exactly the same equation for E,„ as (3.122). 
When y„ 2 = k„ 2 — co 2 g/i is not equal to zero, we have 



E *E._ dS = A„e~ ynZ + B„e ynZ 



where 



J HkxE ln dS = Zo l (A n e-^ - B n e y ” z ) 



k • E dS = kn (A n e- y " z - B n e y " z ) 
k. y„ 



z 0n = yJj(oe 



If yf = o, the above expressions are replaced by 



E-E tn dS= — D n 

J j™ 

k x E,„ dS = C„ + D n z 



(3.124) 



C \ • F k 

k-E - ,n dS = ^(C n + D n z) 

J K jwz 

Thus, the expansion coefficients in (3.103) and (3.104) have been determined 
for all possible cases for to ^ 0. We conclude from this that if none of the 
y„’s are equal to zero, the most general expression for the electromagnetic 
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field in the waveguide when co is not zero is given by 
E = I |e .(V- + B n e^) + k ^ (A„ e - - V*)j (3.125) 

H = Z |k x E„Z 0 -„‘ (Z,<r« + !?„<;'"=) j (3.126) 



where 



y » 2 = ^„ 2 - oj 2 bh 

Z 0n = (>«/£) 1/2 TEM 

Z 0n =jo)fily„ TE 

Z 0n = y n /ico£ ™ 



(3-127) 



(3.128) 



In (3.128), Z 0n for groups I, II, and III are indicated by TEM, TE, and TM, 
respectively, for the obvious reason explained previously. If some of the 
y„’s are equal to zero, the coefficients of the corresponding terms must be 
replaced by either (3.123) or (3.124) depending on the particular group 
concerned. 

In the previous section, it was shown that electromagnetic fields in a wave- 
guide could be divided into three groups, TEM, TE, and TM modes, 
assuming the exponential field variation with z. However, because of this 
assumption, it was not clear whether or not all possible electromagnetic 
fields in a waveguide could be expressed as a linear combination of those 
modes. In this section, the exponential field variation was not assumed 
(indeed, nonexponential field variation appeared in the particular case of 
y„ = 0), and we have demonstrated that every possible electromagnetic field 
in a waveguide is given by the above expressions and that no other functional 
forms need be considered. This is a far stronger assertion than we could 
have made before; however, without this guarantee, the discussions in 
Sections 3.7 and 3.8 would have little meaning. 

Suppose the transverse components of both E and H are given on a refer- 
ence plane perpendicular to the axis of the waveguide. Let the position of 
this reference plane be at z = 0. Then, from (3.125) and (3.126), we have 

E, (0) = I E,„ {A n + B„) , H, (0) = Z k x E,„Z 0 „' (A, - B„) 
which give simultaneous equations for A„ and B„, 

J E,(0) •£,„ dS = A„ + B h , | H,(0) • k x E„ dS = Z, V (A, - B„) 
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These equations determine A„ and B„ for each n and, hence, the electro- 
magnetic field in the waveguide uniquely. In other words, the transverse 
components of E and H at a reference plane are sufficient to specify the 
electromagnetic field in the waveguide. This is true even when some of the 
y n 's are equal to zero. The transverse components of E on two different 
reference planes at z = z y and z = z 2 are also sufficient to specify the electro- 
magnetic field provided that none of the y„(z 2 — z x )’s are equal to jmn 
where m is an integer including zero. If y„(z 2 — z x ) is equal to jmn, the 
transverse components of E cannot be specified independently on the two 
planes, and the electromagnetic field in the waveguide is not determined 
uniquely. Many other ways exist for specifying the electromagnetic field 
uniquely, apart from the two described above, but we shall not discuss 
these here. 

Let us now look at the expansion coefficients appearing in (3.125) and 
(3.126). They have exactly the same functional forms as those of voltage 
and current along a transmission line. We can, therefore, introduce a one- 
to-one correspondence between the electric and magnetic fields in a wave- 
guide and the voltage V„(z) and current I„(z) on an infinite number of trans- 
mission lines, each representing one mode in the waveguide, 

E = z|E»E„(z) + k^Z 0 „/„(z)| (3.129) 

H = Z jk x E,„7„(z) -Txl, (3.130) 
Since the transmission power in the waveguide is given by 



P = Re J kE x H*dS= Re J*E*(-k x H*) 
= Re J Z E ln l 7 . (z) ■ Z E,„I* (z) dS 



= ReZE„(z)/„*(z) 



(3.131) 



it is equal to the sum of the transmission power on each transmission line 
separately. 

For a fixed frequency co, only a finite number of y n 's can become imaginary 
since linv^Ar, 2 = oo. This means that all but a finite number of modes 
are in the cutoff region (cf. Section 3.2). Therefore, as long as we avoid 
sections of the waveguide where cutoff modes are excited, the waveguide 
can be represented by a finite number of transmission lines. The effect of the 
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cutoff modes is usually expressed by multiport networks which connect the 
transmission lines representing the propagating modes as we shall see in 
Chapter 5. 

For each propagating mode, the wavelength in the guide is given by 

where X is the free-space wavelength and X c the cutoff wavelength. The phase 
and group velocities are 

«, = t>o {i - (WT 1 1,2 . v. = »» {i - (W 2 } 1 ' 2 

where v 0 = (ep)~ 1/2 and X c for TEM modes is infinite. These are the 
same formulas as those for the rectangular TE, 0 mode. The characteristic 
impedance is real and it decreases or increases with increasing frequency in 
the case of TE modes or TM modes, respectively. For TEM modes, the 
characteristic impedance is independent of the frequency. It may be worth 
mentioning that the above choice of voltage and current are somewhat 
arbitrary. As we explained in Section 3.2, nV„(z ) and /„(z)/« can also be 
considered as the voltage and current of a transmission line whose charac- 
teristic impedance is « 2 Z 0n . 



3.6 Examples of Waveguides 

As the first example, let us again consider a rectangular waveguide. As we 
discussed in Section 3.4, no TEM modes exist in this case and, since the 
fields for TE and TM modes can all be derived from H z and E z , respectively, 
it is not necessary to deal with the vector differential equation (3.65) di- 
rectly. The scalar eigenvalue problems for H, and E z should suffice. 

For TE modes, the eigenvalue problem is given by 



d 2 H z d 2 H z 



+ k 2 H z = 0 



— - = 0 (x = 0 , x — a) 
dx 



(3.132) 



= 0 (y - 0, y = a) 



The solution can be found by inspection as follows 



H z = A cos (nnxfa) cos (mny/b) 
k 2 = ( nn/a ) 2 + (mnfb) 2 



(3.133) 
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If n — m = 0, k 2 becomes zero and H z , a constant. However, this is the 
constant term we discussed in Section 3.5, and no fields exist corresponding 
to this solution. If a> b, the smallest eigenvalue which gives nontrivial 
fields is obtained when n = 1 and m = 0. This corresponds to the TE 10 mode 
discussed in Section 3.2. 

We know that (3.133) satisfies (3.132); however, a question may arise 
whether or not the functions can form the complete set necessary for our 
discussion. The answer is yes and the proof is given below. Let / be an 
arbitrary well-behaved function defined in S (0 ^ x < a, 0 ^ y < b). Since 
cos(nnx/a) (n = 0, 1, 2,...) are the eigenfunctions of (d 2 <pfdx 2 ) + k 2 q> = 0 
with the boundary conditions dtpfdx = 0 at x = 0 and x = a, they form a 
complete set of orthogonal functions over the region from x = 0 to x = a. 
Similarly, cos(mnyjb) {m = 0, 1, 2, ...) form a complete set. From the 
completeness of cos(nnxfa) (n = 0, 1,2,...), 

f Z A n (y)cos(nnx/a)\ dx 

can be made arbitrarily small for each y with sufficiently large N, where 
the /4„(> , ) , s are the expansion coefficients. Similarly, for each A„(y), 
pb r Af-l ■) 2 

| \ A n(y)- Z B nm cos (mny I b)\ dy 
J 0 ( m=0 ) 

can be made arbitrarily small with sufficiently large M , where the B mn ' s 
are the expansion coefficients. It follows from this that, for a given e > 0, 
the relation 

m JY— 1 M - I ■) 2 

X X B nm cos(nr.xla)cos(mnx/b)\ dxdy 

n = 0 m=0 j 

m N- 1 N— 1 

f{x,y)~ Z A a (y)cos(nnxja)+ X A n (y) cos (nnxfa) 

n=0 n=0 



~ Z Z B nm cos(nnx/a)sin(mnylb)\ dxdy 

n—0m-0 ) 

< 2 JJ {/(*>)')- lAMcosinnxIa)} dxdy 

+ 2 Z a f \ A n{y)~ Z B*mCOs(mnyfb)\ d\ 



can be satisfied with sufficiently large N and M, where Lemma 2 in Appendix I 
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is used, and the integration of the second term with respect to x is performed 
utilizing the orthogonal relations between cosine functions on the right-hand 
side of the first inequality (a/2 for nonzero n is replaced by a). However, 
since e is arbitrary, the above relation shows that cos(nnx/a) cos(mny/b) 
(n,m = 0,1,2,...) form a complete set of orthogonal functions. This 
completes the proof. 

For TM modes, the eigenvalue problem is given by 



d 2 E z d 2 E z 
~dx 2+ ~dp 



+ k 2 E z = 0 



E z = 0 (x = 0 , x = a. 



(3.134) 

y = 0, y = b) 



and the solutions are 



E z = A sin (nnxfa) sin (mny/b), k 2 = ( nitja ) 2 + ( mnjb ) 2 (3.135) 



If either n or m becomes zero, E z — 0 and no corresponding fields exist. 
Therefore, n — in = 1 gives the smallest eigenvalue in TM modes. Since this 
is larger than the eigenvalue for the TE 10 mode, we conclude that among all 
possible waves the TE, 0 mode has the smallest eigenvalue and, hence, the 
lowest cutoff - frequency, provided that a > b. Consequently, there is a 
frequency range in which all modes except the TE 10 are in the cutoff region 
as we stated in Section 3.2 without proof. 

At least, two modes, one TE and the other TM, share one eigenvalue 
k 2 when n and m are equal to or larger than 1 ; in this case, these modes are 
said to be degenerate to each other. When n modes share the same k 2 , they 
are n-fold degenerate. In a square waveguide of side length a, four TE and 
two TM modes share the eigenvalue 25n 2 ja 2 and, the degeneracy is sixfold. 

Let us next consider a circular waveguide with radius a. Again, no TEM 
modes exist. For the TE modes, referring to (2.62) we have 



1 d 2 H ’ 

+ ?w +kH -=°- 



where cylindrical coordinates are used to simplify the boundary condition. 
A method called the separation of variables is often found useful in the 
solution of equations like (3.136). We shall, therefore, assume the functional 
form of the solutions to be the product of a function of r alone and that of 9 
alone, i.e.. 



H, = u(r)u(0) 



(3.137) 
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Substituting this into (3.136) and multiplying the result by r 2 juv, we have 



r d du , . 

~T r ~T + k r ' 

u dr dr 



A 1 d 2 v 
) + ~vW 2 = ° 



The quantity inside the bracket is a function of r alone while the remaining 
term is a function of 6 alone. The equation implies that the first function 
is equal to the negative of the second function. However, a function of r 
cannot be equal to a function of 9 , unless they are both constant. Thus, we 
have 

1 d 2 v _ rdf du\ , , , 

= ulr VTr ) + kr =K (3 - 138 > 



The first equation is equivalent to 



d 2 v . 
M> +nV =° 



(3.139) 



and the solution is given by 



v = A cos nO + B sin n9 

Since we return to the original position when 9 is increased by 2n, the value 
of H z and hence that of v has also returned to the original value. This requires 
n to be an integer or zero. The second equation in (3.138) is equivalent to 



d 2 u 1 du ( . 
d? + rJ r + [ k 



(3.140) 



which is obviously a linear second-order ordinary differential equation 
having two independent solutions. Just as two particular independent 
solutions of (3.139) are called cosine and sine functions and are indicated by 
cos nO and sin nQ, two particular independent solutions of (3.140) are called 
Bessel and Neumann functions and are indicated by J„(kr) and N n (kr), 
respectively. Figure 3.12 shows J n (kr) and N„(kr ) versus kr for n = 0, 1, 
and 2. 

Since N n (kr ) approaches minus infinity as r tends to zero, H z becomes 
discontinuous in S(r ^ a) and is not acceptable as an eigenfunction. There- 
fore, we are left with J„(kr) and the corresponding eigenfunction becomes 



H z = (A cos nO + B sin nO) J„(kr) 



(3.141) 




142 



3. WAVEGUIDES 








3.6. Examples of Waveguides 



143 



is given by 

E z = (A cos n9 + B sinnfl) J„(kr) (3.145) 

To satisfy the boundary condition, the eigenvalue k 2 must be such that 

J n {ka) = 0 (3.146) 

Indicating the wth zero of J„(x) by x = U nm . k 2 is therefore given by 

fc 2 - ( U H Ja f (3.147) 

We have obtained an infinite number of possible solutions for both TE 
and TM modes. Again, however, we have to recall that an assumption was 
made as to the functional form of the solutions in the beginning of the 
analysis when we separated variables. It is, therefore, necessary to show the 
completeness of each set of solutions given by (3.141) and (3.145). To this 
end, let us first consider the orthogonality between the Bessel functions. 
Since U' np and W ng (p ^ q) give different eigenvalues for (3.136), the corre- 
sponding H z s must be orthogonal to each other. For instance, we have 

pa p2n 

cos 2 nQJ„(V' np rja) J n (U^r/a) r d0 dr = 0 (p * q) 

J o J o 

The integral with respect to 6 does not vanish and, hence, we obtain 

J o Jn(V; p rla) J n {U' q r/a)r dr = 0 (p # q) (3.148) 

Similarly, from the orthogonality between the Ef s, we have 



J n (Un P r la) J n (U nq rla)rdr = 0 (p ¥= q) (3.149) 

o 



These are the relations corresponding to (3.33) and are called the (weighted) 
orthogonality relations between the Bessel functions with r being the weight 
function. 

When p = q = m, the left-hand sides of (3.148) and (3.149) are no longer 
equal to zero. Their values can be calculated as follows. Writing (3.140) in 
the form 




u =0 



and multiplying by r(duldr), we integrate the result with respect to r from 
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r = 0 to a. Integrating by parts, this leads to 

2 J“ k 2 u 2 r dr = [(r duldrfj 0 + [( k 2 r 2 - n 2 ) u 2 f 0 

Substituting u = J n (U^ m r/a), k 2 = (U' nm { a f and noting that J n ( 0) = 0 for 
nonzero n and r dujdr = 0 at r = 0 and r = a, we obtain 

f J 2 ( u; m rla ) r dr = ia 2 U;~ 2 (l£ - n 2 ) J, 2 (U„' m ) (3.150) 

Similarly, substituting u = J„(U nm r/a), k 2 = ( U n Ja ) 2 and noting that u = 0 
at r = 0 and r = a when n ^ 0, and rw = 0 at these points when n = 0, we 
have 

J“ J 2 {U nm rjd)r dr = }a 2 {J„'(t/„„)} 2 (3.151) 

where J n '(U nm ) indicates the value of the derivative of the Bessel function 
J n {x) at x = V nm . 

We are now in a position to discuss the completeness of the eigen- 
functions. For example, (3.141) with k 2 given by (3.143) can form a com- 
plete set of orthogonal functions if the relation 

/•a *2 it f N M 

lim \f(r, 6)- £ £ cos nG 

M,W-»odJoJo ( n = 0 m= 1 

+ B„ m sin n6) J n {V' mn r{a )j r d6 dr = 0 

holds for any well-behaved function /defined over S (r ^ a) with the properly 
chosen ^ nm ’s and B nm y s. To prove that this is the case, we first observe that 
cos n9 and sin n0{n = 0, 1,2,...) form a complete set of orthogonal functions 
over the range from 6 = 0 to In as the eigenfunctions of (3.139) under the 
boundary conditions y(0) = y(27r) and v' (0) = v' (2n) (see Problem 3.10). 
Comparing the present problem with the proof of the completeness of the 
functions given in (3.133), it suffices to show that, for a given e>0, the 
relation * 

M ) 2 

9 (r)~ X C ttm J n {V' nm rla)\ r dr < e f 

can be satisfied with sufficiently large M for every n, where g (r) is a piecewise- 
continuous function with the integral Jo g 2 {r ) r dr being finite. However, this 
can be proved by the familiar technique using the following three relations: 
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the orthogonality relation (3.148), the variational expression for k 2 of 

(3.140) under the boundary conditions r (dujdr ) = 0 at r = 0 and r — a, 

f {r(dujdr) 2 + (n 2 jr) m 2 } dr 

= l 

J ru 2 dr 

and the relation lim m ^ 00 k 2 m = co where k 2 ,„ is the wth eigenvalue of 

(3.140) . The infinite growth of k 2 m is obvious, since if k 2 nm remained finite with 
increasing m, the eigenvalues of (3.136) should remain finite, but this is not 
the case. Similarly, the variational expression for k 2 applicable in (3.140) 
under the boundary condition ru = 0 at r = 0 and r = a becomes 



k 2 = '~ 



J [r (dujdr) 2 + ( n 2 jr ) u 2 } dr - [2 ru(dujdr)] a 0 
J ru 2 dr 



and the functions given by (3.145) with k 2 satisfying (3.147) form a complete 
set of orthogonal functions. 

Table 3.1 shows the values of U nm and U' nm for the first three «’s and m’s. 
The fields corresponding to U„ m and U' nm are called the TM nm and TE nm 
modes, respectively. The table shows that the TEn mode has the smallest 
eigenvalue and, hence, the lowest cutoff frequency. There are two independent 
TEj x modes corresponding to the cos 8 and sin 0 terms in (3.141); thus, the 
degeneracy is twofold. The TE 01 and TM 1X modes share the same eigen- 
value, and since there are two independent TM U modes, the degeneracy is 
threefold. 

Let us now consider the TE, t modes. Since the 90° rotation of one field 
pattern around the z axis gives the other pattern, we have only to investigate 



TABLE 3.1 

Values of U n m and U'nm 







Unm 






U'nm 




n 


m - 1 


m = 2 


m = 3 


m = 1 


m = 2 


m = 3 


0 


2.40 


5.52 


8.65 


3.83 


7.02 


10.17 


I 


3.83 


7.02 


10.17 


1.84 


5.33 


8.54 


2 


5.14 


•8.42 


11.62 


3.05 


6.70 


9.97 
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the term with, say, the cosine variation in (3.141). The coefficient is usually 
determined so as to ensure the normalization condition of E (1 derived 
through (3.95). However, since the normalization of H zi guarantees that of 
E,j, let us find the normalization constant for H tl . Writing the normalized 
function H :1 in the form 

H zY = K cos 6J t ( kr ) 

the value of K can be determined from the condition 



| H zl dS = J J K 2 cos 2 OJ 2 (kr) r dO dr = 1 



where 



k = (U' ll la) 



The above integral is readily performed using (3.150), and we obtain 



Using the expression for gradient in cylindrical coordinates given by (2.61), 
an explicit form for E tl can be derived. Substituting the result into (3.125) 
and (3. 1 26), all the components of the electric and magnetic fields are obtained. 

E r = K (kr)~ 1 sin 9J X (kr) (Ae~ jf,z + Be** z ) 

E e = K cos 0J x '(kr) (Ae~ jfiz + Be 1 '*) 

E z = 0 

H r -~K cos 9J X ' (kr) Zq 1 (Ae Jfiz - Be* 2 ) 

H e = K (kr)~ 1 sin 0J X (kr) Z' Q 1 (Ae’*» - Be 10 *) 

H z = Kk(jcop ) _1 cos 6J X (kr) (Ae~ jPz + Be iPz ) 

where A and B are constants, and 

0 = {(o 2 e[i - (UlJa ) 2 } 112 , Z 0 = ((Dftlp) 

Figures 3.13(a) and (b) show the electric and magnetic field patterns, 
respectively, corresponding to the A terms only, i.e., those of the wave 
traveling in the positive z direction. 

One method which will excite the circular TEj x mode is shown in Fig. 3.14 
where a rectangular waveguide is gradually deformed into a circular wave- 
guide so that the rectangular TE 10 mode is gradually transformed into a 
circular TE 1X mode. This is a tapered mode transducer. Since circular TE n 
modes have the lowest cutoff frequency, they are relatively easy to handle, 
and because of their degeneracy, there are several interesting applications. 
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Fig. 3.14. Transformation from rectangular TEio mode to circular TEn mode. 



The rotary vane type attenuator is an example which has two tapered trans- 
f ducers to transform the circular TEj x mode into the rectangular TE 10 modes 

at either end. A thin resistive film is placed diagonally through the axis of the 
circular section, as shown in Fig. 3.15(a). By rotating the circular section 
as a whole, one can vary the angle <p of the film with respect to the incident 
electric field from the rectangular section. The wave in the circular section 
| can be considered as the sum of two TE 1X modes, one having the electric 

field parallel to the film and the other normal to it. The former is attenuated 
by ohmic loss due to the induced current in the resistive film while the latter 
passes through the section without loss. Therefore, if the length of the 
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Fig. 3.15. Explanation of the principle of the rotary vane type attenuator. 



circular section is sufficiently long, the first component becomes negligible 
whereas the second component with amplitude E sin<p appears at the other 
end. This wave can in turn be considered as the sum of two modes, one 
with the amplitude E sin<p sin<p, and the other £sin<pcos<p as shown in 
Fig. 3.15(b). If the input and output rectangular waveguides have the same 
orientation (not twisted to each other), the E sin q> sin (p component exists 
through the output rectangular waveguide while the other one is reflected 
due to the cutoff characteristic of the rectangular TE 0 i mode. In practice, a 
thin resistive film is inserted in each tapered transducer section so that the 
cutoff mode is absorbed rather than reflected. Thus, the output electric field 
is proportional to sin 2 <p and the power is proportional to sin 4 cp. When the 
resistive film is not ideal and has a finite thickness or is slightly misplaced, 
the E sin<p component in the circular section is attenuated slightly; however, 
this attenuation remains the same regardless of the value of q>. The attenu- 
ation in the two tapered transducer sections is also constant. Hence, these 
three attenuations can be considered as the initial insertion loss A 0 . Therefore, 
as long as the E cos <p component in the circular section is well attenuated, 
the total attenuation is given by 



A = A 0 - 40 log 10 sin <p (dB) 

This shows that the relative attenuation is determined solely by the angle q> 
and is independent of frequency. For this reason, rotary vane type attenuators 
are widely used as standards. 

In the above expalanation, the wave incident in the circular section was 
decomposed into two parts, one having substantial attenuation, and the 
other no attenuation. This is due to the fact that the removal of degeneracy 
by the resistive film takes place in such a way that each propagation constant 
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becomes stationary with respect to a small variation of the field. This pheno- 
menon will be further explained in Section 3.9 for waveguides with inhomo- 
geneous media. 



3.7 Waveguide Discontinuities 

In this section, let us consider an infinitely thin window across a waveguide 
such as the one shown in Fig. 3.16(a). The shaded area is a conducting 
diaphragm perpendicular to the waveguide axis and located at z — 0. We 
assume that only one propagating mode exists in the waveguide and use 
subscript 1 to indicate the corresponding fields. All the other modes are in 
the cutoff region. For convenience, let us take the reference planes of the two- 
port network representing the window at z = 0, coinciding with the dia- 



(O) 




Z ll ' Z I2 Z ll Z I2 



(b) 
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Z « Y 



Fig. 3.16. Waveguide window and its equivalent circuit. 



phragm position. The two-port network can be expressed by the equivalent 
circuit shown in Fig. 3.7(b). This equivalent circuit of course cannot be 
used unless a certain length of the transmission line representing the propa- 
gating mode is attached on each side as we explained in Section 3.2. Because 
of the symmetry, Z n =Z 22 and the equivalent circuit reduces to Fig. 
3.16(b). Now, suppose a wave with a unit magnitude is incident from 
z=— oo ; then, since this mode cannot satisfy the boundary condition 
imposed by the diaphragm, a number of higher modes will be generated at 
z = 0. The transverse component of the electric field is therefore expressed 
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in the forms 

E, = (e~ nz + B x e y ' z ) E tl + £' B n e y ” z E,„ (z < 0) 

E, = S C„ e - y » z E ffl (z > 0) 

where £' indicates the summation over ail possible « except 1 and no wave 
is assumed to be reflected back at z = co. E, must be continuous across the 
aperture and vanish on the diaphragm. Consequently, at z = 0 the above 
two expressions should give the same function E r (0) which is zero on the 
diaphram. From this we obtain 

1 + Bi — C t = J E,(0)*E (1 dS (3.152) 

B, = C„ = | E,(0).E„ dS (» 5= 2) (3.153) 



The first condition shows that the voltages associated with mode 1 on each 
side of the window are equal. Referring to Fig. 3.16, this means that Z n 
is equal to Z 12 since otherwise the two voltages at the references plane 
cannot be equal. Thus, the window is found to be represented by a simple 
shunt admittance Y across the transmission line corresponding to mode 1 as 
we stated in Section 3.2 without proof. Furthermore, the two conditions 
guarantee the continuation of H, across the aperture as well as its cancella- 
tion on the diaphragm. A comparison of (3.125) and (3.126) shows that the 
expansion coefficients of E, are equal to the corresponding expansion 
coefficients of H z and, hence, the continuity of E t represented by (3.152) and 
(3.153) also guarantees that of H z . To verify the cancellation of H z , let us 
expand V x E r (0) in terms of the (V x Yjk n )'s: 



V x E, 



V x E,_ 

V x E,(0).— - dS 

K n 



Using the formula for integration by parts and the boundary condition 
n x E,(0) = 0 on the waveguide wall, the above expression reduces to 

V x E f (0) = £ V x E t „ J E f (0) • E,„ dS 

Since E,(0) and hence V x E r (0) vanishes on the diaphragm, the right-hand 
side must be equal to zero. However, using (3.126), the right-hand side is 
equal to -ja>iiH z , since Z 0n =j(o^y n for the modes with nonzero V x E f „’s, 
and therefore H z vanishes on the diaphragm. 
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Next, let us consider H, which is given by 

H, = k x E^ZoV (e~ y,z - B.e™) + £' k x E,„Z„-‘ (- B n ) e"' (z < 0) 

H, = k x E tl Zo 1 1 C l e -1 ' lz + £' k x E (z > 0) 

The condition that H, is continuous across the aperture gives 

k x E n Z„; (1 - B,) - £' k x 

= k x E fl Z 01 C, + ^ k x E tn Z 0 „ C n 

Substituting (3.152) and (3.153), we have 

£E fn ZoX = 0 (in S 0 ) (3.154) 

where S 0 indicates the aperture area. This same condition guarantees the 
continuity of E z across the aperture. Referring to (3.125), E z is continuous if 
the relation 




holds in S 0 . Let F be the left-hand side of (3.154), then F is equal to zero in 
S 0 . Expanding V*F in terms of the (V*E, n /Ar n )’s and using the formula for 
integration by parts and the boundary condition V*E,„=0 on the wave- 
guide wall, we have 

V.F = £V.E (I1 |F.E,„dS 
= ZV-E,„Z 0 -„'B„ 

However, since Z 0 „ = yjjioe , for the modes with nonzero V*E In ’s, the left- 
hand side of (3.155) is found to be jo)c\ • F, and if we recall that F and hence 
V*F vanishes in S 0 , (3.155) is proved showing the continuity of E z across 
the aperture. 

Since we have seen that all the boundary conditions for the fields at the 
window are automatically satisfied if (3. 1 52), (3. 1 53) and (3. 1 54) are satisfied, 
let us now try to obtain the admittance Y, representing the window from 
these equations. Using Eqs. (3.152) and (3.153), Eq. (3.154) can be rewritten 
in the form 

Efi = E tl J E t (0) • E fl dS + E,„(Z 01 /Z 0n ) J E,(0).E fn dS (in S Q ) 

(3.156) 

This is an integral equation for E,(0); therefore, if E,(0) is determined, B y 
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can be calculated using (3.152) and the normalized shunt admittance YjY 01 
of the window will be obtained from 

l+(17*oi) = (3.157) 



where use is made of (1.31) together with the knowledge that the reflection 
B x is produced by the parallel connection of the admittance Y and the line 
admittance V 01 . However, it is relatively difficult to solve the integral 
equation (3.156). Instead of solving (3.156), let us, therefore, express YjY 0l 
in terms of E,(0) and rewrite it with the help of (3.156) in order to study 
general proporties of Y/Y 0i . From (3.152) and (3.157), Y/Y 0 i can be ex- 



pressed by 



_ 2B[ -2(jE.(0).E„dS-l) 



Y<n 1 + 



| E,(0).E„ 



(3.158) 



Also, if we multiply both sides of (3.156) by E,*(0> and integrating over S 0 , 
we have after a transposition 

(l - J E,(0)-E (1 ds) J E,*(0)-E,, dS = £' (Z 01 /Z 0 „) jj E,(0)-E„ dS 
Substituting this into (3.158), we obtain 



2E'(Zoi/ z o„) J E, (0) • E rn dS 

|e,(o)-e n ds 2 



(3.159) 



Since all the modes except the one with subscript 1 are in the cutoff region, 
the (Z 01 /Z 0 „)’s are all pure imaginary, and hence Y/Y 01 is a pure susceptance. 
Furthermore, if only one type of mode, for example, TE or TM, is excited 
by the window, a compariosn of (3.128) and (3.159) shows that Y/Y 0 i 
becomes inductive or capacitive, respectively. 

If we look at the expression (3.159) closely, we soon realize that the value 
of YjY 0i remains the same when E t (0 ) is multiplied by a constant. This 
reminds us that all the variational expressions previously discussed had a 
similar property. Let us therefore check to determine whether or not (3.159) 
is a variational expression for Y/Y 0 x . To this end, we multiply both sides of 
(3.159) by the denominator on the right-hand side and take the variation. 
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The result is given by 



(y/y 0l ) jj E,(0)-E„ dsjd E,*(0).E„ dS 

+ J E r *(0) ‘E,! dS J JE,(0)-E, 1 dsj + (AYIY 01 ) ( E,(0).E„ dS 
= 2£'(Z 0 i/Z„„) j j E,(0).E,„dS j JE,*(0).E,.dS 
+ J E,*(0)-E,„ dS J JE,(0).E,„ dsj (3.160) 



The right-hand side can be rewritten first using (3.156) and then (3.158), as 
follows, 



2 J zlE,*(0)*E, 1 dS ^1 - J E t (0).E tl ds'j 

- 2 J dE r (0) • E,i rfS ^1 - J E f *(0)-E tl ds'j 

= (TOi)|j ^E t *(0) ‘E tl dS j E f (0) *E rl dS 
+ J dE t (0) *E,j dS | E,*(0) *E n ds\ 



where use is made of the fact that YjY 0l and the (Z 01 /Z 0n )’s are all pure 
imaginary. Substituting this into (3.160), we obtain the desired result 

(^y/y 01 )|J e,(o)-e,! ds 2 = o 

This shows that if a trial function slightly different from E f (0) is substituted 
in (3.159), it gives an approximation at least one order higher for Y/Y 0l than 
that of the trial function. Conversely, suppose that d F/V 0l is equal to zero 
for all possible variations dE t (0) from E,(0). Although this E t (0) may not 
satisfy (3.156), it is always possible to choose a constant K such that /CE f (0) 
becomes a solution of (3.156). This will be shown as follows. If AY/Y 0l is 
equal to zero, (3.160) becomes 

ReJ" j(E/I'oi)E,, |e,(0).E (1 dS 

- 2 J' (Z 01 /Z 0n ) E„ J E, (0) • E,„ rfsj JE,* (0) dS = 0 
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However, since dE,*(0) is arbitrary over the aperture, the terms inside the 
brackets must equal zero in S 0 . Equivalently, we have 

HYlYoi)*t i f (0) • E, i dS = X'(Z 01 /Z 0 „) E,„ f E r (0)-E f „dS (in S 0 ) 

(3.161) 

which is linear with respect to E,(0). Let K be a constant satisfying 

jqi + MWi)} [e,(o)-e„ ds = l 

Replacing E r (0) by AE,(0) everywhere in (3.156) and substituting (3.161), 
we can easily see that KE,(0) satisfy (3.156). We conclude from this that 
the solution of (3.156) is equivalent to the discovery of E f (0) which makes 
(3.159) stationary. Thus, (3.159) is a variational expression for YjY 0i in 
which E,(0) is a solution of the integral equation (3.156). At this point the 
question may arise as to whether or not the solution of (3.156) is unique. 
The answer is generally no. Nevertheless, we can assert that YjY 0l is uniquely 
determined. Suppose A and B are the two solutions of (3.156), then the 
difference (A - B) satisfies the equation 

E fl j* (A — B)*E ( , dS + Yj' EJZoi/Zoi,) J (A — B)»E,„ dS — 0 

Multiplying by (A-B)** and integrating the result over S 0 , we have 

2 If 2 

(A — B)-E fl dS +F( z oi/Zo*) jj (A-B)-E,„dS =0 

Since the first term is pure real while the second term is pure imaginary, 
the first term must be zero which implies 

J*A.E tl dS = J BE tl dS 

Consequently, the values of YjY 0l for E,(0) = A and E,(0) = B must be 
the same as we can easily see from (3.158). The field corresponding to A - B 
does not couple to the outside, and there is no way of exciting it; however, 
such fields can exist mathematically at certain discrete frequencies determined 
by the shape and size of the aperture. In practice, by deforming the aperture 
slightly, we can couple the fields to the outside and observe resonances in 
the vicinity of the discrete frequencies, the effect appearing in T/y 01 for the 
deformed window. 

Once we have established that the window can be represented by a simple 
shunt susceptance across the transmission line representing the dominant 
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mode, the value of the susceptance is best determined experimentally using 
a standing wave detector as described in Section 3.2, especially when the 
shape of the aperture is irregular. 

To illustrate how to use the variational expression (3.159), let us next 
consider a symmetrical inductive window in a rectangular waveguide, as 
shown in Fig. 3.6 where the aperture width is indicated by c. The dominant 
mode is the rectangular TE 10 mode, and it is obvious from the configuration 
of the window that only the TE n0 modes are necessary to satisfy the boundary 
condition on the diaphragm. E f „ and for these modes are given by 

E, n = i yy j2(aby 1/2 sin (nnx/a) 

z 0i iz 0n = vJvt = -J(V^) {M2«) 2 - 1 } 1/2 

Substituting into (3.159), we have 

[e,( 

J 6 

Since E f (0) has only the ^-component and it vanishes at the diaphragm, 
E f (0) = Ki y sin [n {x - |(u - c)}/c] 
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is chosen as a trial function for E,(0) over the aperture. This is exact in the 
extreme case of no diaphragm. With the trial function, Y/Y 01 becomes: 



y 01 J k cos 2 (nc/2a ) 



r (M2 cif - 1 } 1/2 

L {1 - ( tic /*) 2 } 2 

b = 3, 5, 7, ... 



cos 2 (mtcj2a) 



Although this is an approximate expression for YjY 0U it is expected to be a 
good one. If a - c is small, the series converges rapidly and only the first 
few terms are necessary to obtain as much accuracy as the trial function will 
permit. As an example, |T/T 01 | is calculated using the first three terms and 
shown by the solid line in Fig. 3.17 (a = 4.755 cm, X = 6.32 cm). The white 
circles indicate the experimental results. The agreement is fairly good, 
considering the crude approximation of the trial function. 



3.8 Effect of Wall Losses 

So far we have discussed waveguides with perfectly conducting walls. In 
practice, however, waveguides have finite conductivity and elctromagnetic 
energy is converted into heat due to ohmic losses when wall currents flow. 
Consequently, attenuation of wave amplitude takes place whenever a wave 
propagates down the waveguide. If attenuation is neglected in a long 
waveguide, an erroneous conclusion may be reached; hence, a new wave- 
guide model is required which gives a better representation than the one 
with perfectly conducting walls. To obtain such a model, let us first consider 
the boundary conditions at the conducting walls. Let rectangular coordinates 
£, rj, C be such that the plane coincides with the conductor surface, and 
the positive direction of the ( axis is toward the inside of the conductor. 
The conductor surface may be curved, but we are only interested in a very 
small area of the surface which can be well represented by the £rj plane. 
Since a very thin conductor sheet can shield a high frequency electromagnetic 
field quite effectively, let us assume that the electromagnetic field attenuates 
rapidly with £, and, hence, the change of the field in both £ and q directions 
can be neglected compared to its change in the £ direction. This means that 
we can assume d/d^ = d/dtj = 0. Hence, the electromagnetic field in the 
conductor becomes that of the plane wave discussed in Section 2.3. 

Of the two solutions for the plane wave, the one with a growing exponential 
factor has to be abandoned to comply with the above assumption of attenu- 
ation with increasing £. The ratio of E ^ to //, as well as that of E n to — H ( 
therefore becomes the characteristic impedance Z w of the wall conductor. 
This fact can be expressed by 



n x E = Z W H 



( 3 . 162 ) 
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where n is the normal unit vector coinciding with the £ axis. From (2.101), 
Z w is given by , /2 

z w = ( 1 +j)(w/i w / 2<7 w ) 1/2 

where o w and are the conductivity and permeability of the wall conductor, 
respectively. 

Since the tangential components of E and H must be continuous across 
the boundary surface, as we can easily see from a discussion similar to the 
one presented in Section 3.1, (3.162) serves as the boundary condition for 
the electromagnetic field inside the waveguide. It can be rewritten in terms 
of the transverse and longitudinal components of the electric and magnetic 
fields: 

nxk£ z = Z w H, (3.163) 

nxE, = Z w kH z (3.164) 

Using (3.60) and (3.61), (3.163) becomes 

n x ky~ l V-E f = Z w {jwf)~ l (ykxE,+kx y~ l VV-E,) 

Multiplying both sides of this equation by yk x , we have 

nV • E, = - Z w (jwfi) 1 (y 2 E, + VV • E,) 

Since the right-hand side contains y which is yet to be determined, let us 
rewrite this last equation with the help of (3.65) in the form 



• E r = - Z w (my) 1 (V x V x E t - a> 2 e/iE ( ) (3.165) 

On the other hand, using (3.59), (3.164) becomes 

nxE, = - Z w (jo>n)-' V x E t (3.166) 

These are the boundary conditions which the electromagnetic field must 
satisfy in the new model of the waveguide. The problem of investigating the 
effect of wall losses is thus reduced to that of solving (3.65) under the boundary 
condictions (3.165) and (3.166), which is again an eigenvalue problem. If 
E f is obtained for a certain k 2 , the other field components can easily be 
derived from it as before. It is worth mentioning that the equations involved 
are all linear, and therefore any superposition of the electromagnetic fields 
thus obtained also constitutes a solution for the field in the waveguide. 

When Z w is small, as is usually the case, it might be expected that there 
is a solution E ta similar to E ffl , the solution of the original eigenvalue 
problem whose boundary conditions are given by (3.67). Thus, if we expand 
E, a in terms of a complete orthonormal set E,„ (n = 1,2,...) in the form 



1 C n E, 

n*a 



(3.167) 
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where 

C„ = jE„.E„dS (3.168) 

the C„’s are expected to be small and of the same order of magnitude as 
Z w . Since E fa is a solution of (3.65), it satisfies 

VxVxE, I -?V.E (I -rt, = 0 

where k x 2 is the corresponding eigenvalue. To determine k 2 as well as the 
C„’s, let us multiply the above equation by E,„- and integrate over S, the 
waveguide cross section. Using the formula for integration by parts twice, 
we have 



| (E tn • V x V x E fa - E fn • VV • E, a - k a 2 E tn • E fa ) dS 

= J E fa • (V x V x E In - VV • E,„ - k a 2 E tn ) dS 

+ J n • (E ra xVxE In -E,„xVx E, a - E, n V.E ta + E, a V-E tn ) dl 
= 0 

The first and second terms in the bracket in the second surface integral can 
be replaced by a single term k 2 E,„. The second term in the contour integral 
is equal to n x E„,‘Vx E, a and since nx E,„ is equal to zero on L, this 
term vanishes. Similarly, the fourth term vanishes because V*E t „ is equal 
to zero on L. The first and third terms are n x E, a * V x E*„ and E in *nV *E ta , 
respectively. Substituting (3.166) and (3.165) into these expressions, the 
above equation gives 

(k* - O J E„ • E„ dS = (ja>nY' J Z w {V x E„ • V x E,„ 

- (V x V x E, a ~ (o 2 £fiE ta ) ♦ E<„} dl (3.169) 

Let us assume for the moment that all the C„’s are of the same order of 
magnitude as Z w . Then, E (a in (3.169) can be replaced by E (fl , as long as 
we are interested in the approximation up to the first order of Z w . Setting 
n = a, (3.169) becomes 

k a 2 - k 2 = (ja>}iy 1 J Z w {V x E, fl • V x E tfl 



- (V x V x E, a - (o 2 eh EJ • E J dl 



from which the new eigenvalue k a 2 can be calculated. 




(3.170) 
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Since integrations similar to the one which appears on the right-hand 
side of (3.170) will be used several times later, let us introduce a symbol 
Z w (n, m) defined by 

Z w («, m) = (j(Ofi)~ 1 J Z w (V x E (fl • V x E, m 

- (V x V x E In - (o 2 £fiE tn ) • E tm } dl (3.171) 

Since we have, from the definition of E fn , 

V x V x E ln - o) 2 E/iE f „ = VV • E, n + y„ 2 E fn = jo)^y n Z 0n l E tn 

where Z 0n is the characteristic impedance of the nth mode given by (3.128), 
then it follows that Z H ,(n, m) can also be written in the form 

Z w (n, m) = J Z w {(jcon)- 1 V x E tn - V x E tm - y n Z~ 0 i E f „ . E fffl ) dl (3.172) 

Next, setting n^a in (3.169) and substituting the result into (3.168), we 
obtain C„ to the same order of approximation as before. 

c, = | E„-E„ dS = (fc„ 2 - k a 2 )-' Z w (a, n) (3.173) 

If every k n 2 (n ^ a) is sufficiently different from k a 2 , then every C n becomes 
of the same order of magnitude as Z w , as expected, and E, a is determined in 
its expanded form. However, if E fa is degenerate, there is at least one k 2 
equal to k a 2 in which case the denominator on the right-hand side of (3.173) 
becomes zero and the corresponding C„ can no longer be considered small. 
This requires a reconsideration of the assumption we made in the derivation 
(3.170) and (3.173) from (3.169). 

To facilitate the discussion of degenerate cases, we first show that 

{ Z„,(a , b)/Z 0b ] = {Z w (b, a)IZ 0a } (3.174) 

where a and b indicate the degenerate modes and hence k a 2 = k b 2 and 
y a = If one of the degenerate modes is a TM mode or if both are TEM 
modes, the first term in the integral on the right-hand side of (3.172) dis- 
appears, and it is obvious that (3.174) holds. If both modes a and b areTE, 
then Z 0a = Z 0b and (3.172) show that Z H ,(a, b) = Z w (b, a). However, using 
the relation Z 0a = Z 0b again, it is obvious that (3.174) holds. We conclude 
from these observations that the relation (3.174) holds between any two 
modes degenerate to each other. 

For the moment let us confine ourselves to the case of twofold degeneracy. 
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When mode b is degenerate to mode a, C b may become large, and the deri- 
vations of (3.170) and (3.173) become invalid since they assume that all the 
C„’s are small. To avoid this difficulty, we remove the assumption that the 
coefficient of E tb is mall in (3.167) and write E ta in the form 

E„ = AE,. + BE,„ + £ C„E,„ (3.175) 

n*n, b 

where the C„’s are expected to be small. Substituting this into (3.169), 
setting n equal to a and to b in turn, and taking terms up to the first order 
of Z w , we obtain 

(k 2 - k 2 )A = AZ w (a, a) + BZ w (b, a) 

(k 2 - k 2 ) B = AZ w (a, b ) + BZ w (b , b) 

Transposing all the terms to the left, they become 

{Z w (a, a) - ( k a 2 - k 2 )} A + Z w (b, a) B = 0 
Z w (a, b) A + { Z w (b , b) - ( K 2 - k 2 )} B = 0 

These are simultaneous equations for A and B. For nontrivial solutions to 
exist, the determinant of the coefficients must vanish. This condition gives a 
quadratic equation from which the value of (k 2 — k 2 ) can be determined. 
Substituting this into (3.176), the ratio of A and B is determined. By sub- 
stituting the first two terms on the right-hand side of (3.175) for E ta every- 
where on the right-hand side of (3.169) and combining the result with (3.168), 
the coefficient C„ can be obtained for any particular set of A and B, one of 
which is arbitrary. Since a quadratic equation generally has two different 
roots, two independent solutions E ta and E, e are determined by this proce- 
dure. In this case, E (a and E, p are no longer degenerate. As in this example, 
there are many cases in which degeneracy disappears due to a small pertur- 
bation given to the originally degenerate system, this phenomenon is called 
the removal of degeneracy (cf., discussion at the end of Section 3.6). 

In order to solve (3.176), it is not necessary to follow the above general 
procedure. Because of the relation (3.174) between the coefficients of (3.176), 
the simultaneous equations are satisfied, if 

A = Z\ 12 cos 0, B = Zji 2 sin 0 (3.177) 

or 

A = - Z X J 2 sin 0 , B = Zjj 2 cos 6 (3.1 78) 

where 9 is determined by 

, 2(Z 0 fl /Z 06 ) 1/2 Z w (a, b) 

tan 29 = 7 r — — 7 T- 

Z w (a, a) — Z w (b, b) 



(3.179) 
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This can be checked by substituting (3.177) or (3.178) into (3.176) and 
showing, with the help of (3.174), that the two equations give the same 
( k a 2 — kf). The two solutions E t£t and E tp are, therefore, given by 

E (a = Zo. 2 cos 0E (a + Z\j 2 sin 0E lb + X C E,„ (3.180) 

n*a, b 

K, = -Z£l 2 sm0E,„ + z;' 2 cos0E„,+ X C . E .„ (3-181) 

n *a, b 

The corresponding eigenvalues are obtained from the following relations 
which are the result of substituting (3.177) and (3.178) into (3.176). 

k a 2 - k 2 = Z w (a , a) + {Z 0b IZ 0a ) 1/2 Z w (b, a) tan 9 (3.182) 

k a 2 - k 2 = Z w (b, b ) - (Z 0 JZ 0b ) 1/2 Z w (a, b ) tan0 (3.183) 

Similarly, Cf and C" can be obtained from (3. 168) if E, a is replaced by the 
first two terms of E, a and E (fi , respectively. 

Let Ej®* and Efj?* be the principal parts; i.e., the first two terms, of E, a 
and E,p, respectively. The terms E, ( a ; and Ejj 0 are not necessarily orthogonal 
to each other in the sense of (3.70); however, since the magnetic field 
corresponding to E$ } is given by 



/-sin0 cos0 

H,™ = k x I + =jjj- E l( , 

\ t'fth 



we have the relation 



J E,? x H<° 



nW-kdS = 0 



The subscripts a and ^ can be interchanged without changing the result. If 
modes a and b are in the frequency range where propagation is possible, 
Zo. and Zot are both positive real, and hence the above relation can be 
rewritten in the form 

J Ei a 0) xHy*.kdS = 0 



This indicates to a first order approximation, that the two waves corre- 
sponding to E ta and E lfi carry power independently of each other. 

When the degeneracy is more than twofold, (3.175) has to be replaced by 

E(a = x4E ta + BE tb + CE tc -\ b Y, C n E tn 

where modes a, b , c, etc., are degenerate to each other and the last summa- 
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tion is over all possible n excluding a, b, c, ... . The procedure to obtain 
appropriate solutions is essentially the same as before. It must be realized, 
however, that an mth order algebraic equation has to be solved in the case 
of wi-fold degeneracy. 

Suppose that the eigenvalue k a 2 is obtained using the above procedure, 
usually called the perturbation method. Then since k 2 = y 2 + we have 

7a 2 - 7 2 = K 1 ' K 2 

from which y a can be calculated. Let Ay be the variation from y a due to the 
wall impedance Z w ; i.e., Ay = y a — y a . Then Ay is given by 

Ay = - ba l (K 2 ~ k * 2 ) 

If subscript a indicates a nondegenerate propagating mode, then y a =jfi a , 
and we obtain the following formula from (3.170) and (3.171): 

A7 = W:'Z w {a,a) (3.184) 

The real part of this expression gives the attenuation constant due to the 
lossy walls. 

As an example illustrating how Ay is calculated in practice, let us take the 
TE 10 mode in a rectangular waveguide. For this mode, E, is given by 

E, = i y j 2 {ab)~ 1 ' 2 sin(7ix/u) 

Therefore, we have 

V x E t = k(nla)yj2(aby 112 cos(nxla) 

VxVxE, = ij, (n/a) 2 yj2(ab )~ 1/2 sin(7cx/a) 

Substituting these into (3.171) and then inserting the result in (3.184), Ay 
is calculated to be 




The real part of this expression gives the attenuation constant. 
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In general, Z w is proportional to co l/2 , and for large co, fi is proportional 
to <y. Therefore, the attenuation constant due to the wall losses increases in 
a manner approximately proportional to co l/2 for large co. On the other 
hand, as co decreases toward the cutoff frequency, the attenuation constant 
again increases, since /I decreases, and at the cutoff frequency the attenu- 
ation becomes infinite. As a result, at a certain co, the attenuation constant 
takes a minimum value. For certain waves whose E, vanishes on the wall, 
the term proportional to co 2 in (3.171) disappears and the attenuation 
constant decreases indefinitely as co increases. The TE 0 „ modes in circular 
waveguides are an example. Although the circular TE 0n modes are degen- 
erate to the TM ln modes, the removal of the degeneracy takes place in such 
a way that (3.184) is directly applicable since Z w (a, b ) happens to be zero 
when a and b indicate the TE 0n and TM ln modes, respectively (the degen- 
eracy between two independent TM ln modes is not removed by uniform 
wall impedance Z H ,). 

Let P be the transmission power of a wave propagating in the positive z 
direction and P L be the power loss per unit length of the waveguide walls. 
From the conservation of energy, the decrease of P per unit distance must 
be equal to P L ; i.e., 

P L = -{dPjdz) 

Since the amplitude of the wave is proportional to exp(— az) where a is 
the attenuation constant, P is proportional to exp(— 2 az) and we have 

« = -i (dPldz)IP = i(PJP) (3.185) 

In order to obtain a from this formula, we calculate P L and P assuming that 
the transverse electric field of the wave is approximately given by AE ta , 
where A is a real constant. The wall loss per unit length is calculated from 
the integral of the Poynting vector over a unit length of the wall. Using the 
boundary condition (3.162), the loss is given by 




If E t(J belongs to TEM of TM modes, V x E fa = 0, and we have 
Pl = ~ A 1 Re(y o Z 0o ) _l Z w (a, a) 

On the other hand, if E, a belongs to a TE mode, then Z 0a = j(Ofi/y a and P L 
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becomes 



P L = - A 2 Re(jcofi) 1 Z w (a, a) 



Since P is given by the integral of the Poynting vector over the waveguide 
cross section, we have 



= Re J E x H* 



■kdS = A 2 Z; 



Substituting these expressions into (3.185), the attenuation constant is given 
by 

a = - Re {\y a l Z w (a, a)} = Re {\j$ a 'Z w {a, a)} 

regardless of the type of the mode under consideration. This result is identical 
with that obtainable from (3.184). Since the present method for calculating 
a utilizes the relation between the transmission power and wall loss, it is 
called the power-loss method. A serious drawback to the method is the 
possibility of obtaining wrong attenuation constans for degenerate modes 
since the assumption that the transverse field is approximately given by 
AE ta is wrong in most cases. When E tfl is degenerate, as we discussed before, 
there is no easy way of finding appropriate field configurations by the power- 
loss method in contrast to the perturbation method previously employed. 

Before closing this section, let us briefly discuss the effect of lossy media. 
When the conductivity a of the medium inside the waveguide is finite, j'coeE 
on the right-hand side of (3.52) is replaced by (<j -I- jcos) E. No other modifi- 
cation is necessary, and all the results obtained from (3.52) and (3.53) are 
valid if £ is everywhere replaced by s + (a I ja>). Thus, the relation between 
k a 2 and the propagation constant y is given by 



k 2 =y 2 + co 2 {e + {ojjaj)} ii 



(3.186) 



from which the effect of the conductivity on the propagation constant can 
be calculated. Assuming that a is small, and writing y = y a + Ay, Ay becomes 

Ay ^ i/yjW/* = 'coan . 

Therefore, when a is taken into account, the increment in y is real and 
positive for a propagating mode, which means that the attenuation constant 
increases but the phase constant stays the same. 



3.9 Waveguides with Inhomogeneous Media 



In this section, we consider the case in which e and n are functions of 
transverse position but are independent of z. Maxwell’s equations are given 
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by (3.52) and (3.53) and, hence, (3.57)-(3.60) are all valid without modifi- 
cation. On the other hand, the divergences of (3.52) and (3.53) give V *eE = 0 
and V */iH = 0, respectively, from which we obtain 

V • eE, = yeE z , V • = y^tH, (3.187) 



in place of (3.61) and (3.62). We can eliminate H„ E z , and H z from Eqs 
(3.57)-(3.60) and (3.187) leaving a differential equation involving E, alone. 

fi\ x /T 1 V x E, - Ve _1 V*eE, -(o>V + y 2 ) E, = 0 (in S) (3.188) 

If both e and /i are constant in the waveguide cross section S , this equation 
obviously reduces to (3.65). From the requirement that the tangential 
component of electric field must equal zero on the waveguide walls, the 
boundary conditions are given by 

n x E, = 0 , V • eE, = 0 (onL) (3.189) 

Thus, we have an eigenvalue problem, namely, the differential equation 
(3.188) with the boundary conditions (3.189) and a constant y 2 to be deter- 
mined. Since e and /i are not constant over the waveguide cross section, 
(d 2 eh + y 2 can no longer be considered as a constant to be determined. 
Once the solutions of the eigenvalue problem are obtained, all the other 
components of the electric and magnetic fields can be calculated using Eqs. 
(3.57)-(3.60) and (3.187) as in the case with a uniform medium. 

Let E rm and E t „ be two different eigenfunctions and y m 2 and y„ 2 be the 
corresponding eigenvalues. Since E tm is a solution of (3.188), we have 

fi\ x n~ l \ xE lm ~ WT 1 V •eE tm - (a) 2 Efi + y m 2 ) E Im = 0 

Multiplying by (V x V x E,„ - o) 2 £E tn )* and integrating the result over 
S, we obtain 



J fi(\ x /T 1 V x E, m - tu 2 £E lm ) * (V x n ~ J V x E tn - w 2 £E,„) dS 

- J <u 2 £ - 1 (V • fiE rm ) (V • £E, n ) d S 

~ y m 2 | J 1 (V x EJ . (V x E,„) dS - J 0 > 2 eE lm • E rn dsj == 0 



(3.190) 
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where use is made of (3.189). Interchanging the subscripts m and n and 
subtracting the result from (3.190), we have 

bn 1 ~ yj) || /i ' ' : 1 (V X E J • (V x E,„) dS - j cohE, m ■ E„ <is| = 0 

This shows that, between two eigenfunctions with different eigenvalues, 
there is an orthogonality relation expressed by 

J II-' (V x Ej.(v x EJ dS - J < 0 2 eE, m .E,„ dS = 0 (3.191) 

Let H t „ be the magnetic field corresponding to E,„. From Eqs. (3.57)-(3.60), 
k x H tn is expressed in terms of E fn as follows : 

k x H tn = - (jcoy n )~ 1 (V x/i-'VxE,- a> 2 sE tn ) (3.192) 

Therefore, the above orthogonality relation is equivalent to 

J k • E lm x H,„ dS = 0 (3.193) 

When the orthogonality relation (3.191) holds, we can derive another 
orthogonality relation from (3.190) given by 

| H(V x /T 1 V x E fw - ct) 2 fiE fm ) • (V x /T 1 V x E tn - m 2 sE tn ) dS 

- j (o 2 e~ 1 (V • fiE, m ) (V • £E,„) dS = 0 (3.194) 

Using (3.57), (3.187), and (3.192), this can be rewritten in the form 

J s' 1 ( V x H, J • (V x H J dS - J o o 2 fM, m ■ H,„ dS = 0 (3.195) 

which is the dual of (3.191). 

For waveguides with homogeneous media, we found the simple ortho- 
gonality relation given by (3.70). In general, however, it is impossible to 
find a corresponding simple orthogonality relation when the medium is 
inhomogeneous; i.e., 

Je„-e„#o, J £ e„.e„#o 
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A variational expression for y 2 is given by 



y 2 (Ei) = - 



J ,u(V y. 11 1 V x E, - co 2 s E t ) 2 dS - J co 2 e 1 (V ‘sE,) 2 dS 

fM-'PxW-jWdS 



In the derivation, an assumption is made that the denominator does not 
vanish. Although it is likely that the eigenfunctions of (3.188) and (3.189) 
form a complete set of orthonormal functions, a procedure similar to the 
one employed in Section 3.4 fails to prove the completeness when use is made 
of (3.196). To the author’s knowledge, the completeness has not yet been 
proved. However, since (3.196) is a variational expression, it can be used 
advantageously to obtain approximate values for the y 2 's. 

If the waveguide medium is homogeneous, we could classify the fields 
into three groups: TEM, TE, and TM modes. This kind of classification, 
however, is not possible if the medium is inhomogeneous. For almost all 
modes, both E z and H z are finite. 

It is a straightforward process to extend the discussion to include the effects 
of finite conductivity in the medium, we have only to replace e by e + {ajjm) 
everywhere in the above discussion. In the rotary vane attenuator discussed 
in Section 3.6, the degeneracy of two TE n modes is removed in such a way 
that the expression (3.196) with e replaced by e + (a/jw) becomes stationary 
for each independent mode. The modes with the electric fields perpendicular 
and parallel to the resistive film give the smallest and the largest attenuations, 
respectively, and, hence, they are independent of each other. When the 
film resistivity is small, the field configuration for the second mode will be 
considerably distorted by the presence of the film. Nevertheless, the dis- 
cussion for the total attenuation of the attenuator remains unchanged. 



PROBLEMS 

3.1 Using the variational expression given by (3.34) and a trial function sin (nxfa), 
calculate the first eigenvalue of the following eigenvalue problem. 

(d^Ey/dx 2 ) + k 2 E y = 0 
E y = 0 at x = 0 and x — a — d 

where d is assumed to be small. Compare the result with the exact value k 2 = re 2 / 
(a - d ) 2 . 

3.2 Prove that (3.96) is a variational expression for the eigenvalue problem defined by 
(3.93) and (3.94). 
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3.3 Using the variational expression (3.96), calculate the eigenvalue of the TEio-like 
mode in a waveguide whose cross section is a rectangle with the four comers removed 
as shown in Fig. 3.18. 




Fig. 3.18. Deformed cross section of a rectangular waveguide. 

3.4 Calculate the electric and magnetic fields of the dominant mode in a semicircular 
waveguide. 

3.5 The rectangular TEn and TMn modes are degenerate to each other. Following the 
discussion in Section 3.8, calculate how the removal of the degeneracy takes place 
when the wall losses are taken into account. 

3.6 Suppose the inner surface of a circular waveguide with radius a is coated by dielectric 
material ( e = £ 0 e s ) of thickness d (5 <g a). Discuss the effect of the coating on the 
TEoi and TMn modes by calculating how the degeneracy is removed. 

3.7 Suppose two rectangular waveguides are placed side by side and two holes are drilled 
through the common wall a quarter wavelengths apart as shown in Fig. 3.19. Part 
of the power coming in from port I will leak out from port IV, but none will emerge 





Fig. 3.19. Directional coupler. 
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from port III since the waves coming through the two holes cancel each other due to 
the half wavelength path difference. Similarly, if a wave is incident on port II, some 
power leaks out from port III, but none emerges from port IV. Because of this 
directional property, the device is called the directional coupler. If three holes are 
drilled, each a quarter wavelength apart, and the amount of leakage through each 
hole has the voltage ratio 1:2:1, then the directional property is obtained over a 
wider frequency bandwidth than in the case with only two holes. Using a complex 
vector diagram with three vectors, each representing the magnitude and phase of the 
leakage through each hole, explain why the bandwidth becomes wider. 

3.8 Cross one rectangular waveguide on top of another and drill two cross-shaped holes 
through the common wall as shown in Fig. 3.20. Then, part of the power coming in 



n 




i 



Fig. 3.20. Cross-guide directional coupler. 

from port I leaks out from port IV but none from port III, giving the directional 
coupler action. Explain the principle of operation qualitatively, considering the field 
configuration of the rectangular TEio mode. This is inherently a broadband device 
since the cancellation does not utilize the path length difference. 

3.9 Prove that y 2 is real for the waveguides with inhomogeneous media when a = 0. 
The Ejn’s can be chosen to be real, and (3.193) is equivalent to 

J k-E-m X H* dS = 0 

for the propagating modes m and «, which indicates that each wave carries its own 
independent power. 

3.10 Prove that a variational expression for the eigenvalue problem (d 2 E y /dx 2 ) + k 2 E y = 0 
under the boundary conditions E y ( 0) = E ¥ (a) and {dE y (0)/dx} = {dE y (a)/dx} is 
given by 

k*{E y ) = 

J {dEyjdx) 2 dx - [Ey dEy/dx] 0 a + Ey( 0) {dEy(c,)/dx} ~ Ey(a) {dEy(0)/d X } 

j Ey 2 dx 




CHAPTER 4 



RESONANT CAVITIES 



Resonant cavities are devices constructed so that one can utilize resonant 
phenomena of electromagnetic fields in a space enclosed by good conducting 
walls. They are useful as circuit elements, particularly as microwave counter- 
parts of LC resonant circuits in low frequency ranges. Furthermore, their 
analysis offers an opportunity to demonstrate a powerful eigenfunction 
approach. For these reasons, this whole chapter is devoted to the theory 
of resonant cavities. An equivalent circuit of the cavities is obtained and a 
method to experimentally determine the circuit parameters is studied in 
detail. In addition, a discussion of cavities with inhomogeneous media is | 

briefly presented. In contrast to waveguides with inhomogeneous media, the 
completeness of the eigenfunctions can be shown without difficulty. 



4.1 Introduction 

Conductor walls forming a resonant cavity generally have finite con- 
ductives and, hence, introduce losses of electromagnetic energy. Further, in 
order to utilize the resonances in the cavity, there must be at least one opening 
through which the inside and outside of the cavity are connected. However, 
as an idealized model, let us first consider a space completely enclosed by a 
perfect conductor and study how electromagnetic fields behave in it. Max- 
well’s equations are given by 



VxH= jcoE E 





170 



(4.1) 

(4.2) 
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Substituting V x (4.2) into (4.1), we have an equation in E alone; 

V x V x E - a> 2 epi E = 0 (in V) (4.3) 

where an assumption is made that e and n are constant in the volume V of 
the cavity. The boundary condition for E is given by 

n x E = 0 (on S) (4.4) 

where n is the unit vector directed outwards normal to the wall surface S. 
Equations (4.3) and (4.4) constitute another eigenvalue problem whose 
solutions can exist only when m 2 e^ = k 2 takes certain discrete values. For 
instance, for a rectangular enclosure with sides a, b and c, the eigenvalues 
are given by 

k a 2 = = ( nnja ) 2 + {mnjb) 2 + {Injcf (4.5) 

where n, m, and / are arbitrary integers including zero, provided two of 
them do not become zero simultaneously. This type of cavity can be con- 
sidered to be a rectangular waveguide which is an integer multiple of a half 
wavelength long and short circuited at both ends. There are two different 
field configurations for each set of n, m, and /, one corresponding to a TE 
mode, and the other to a TM mode. Accordingly, the field configurations 
are designated as TE nm/ and TM„ ml modes. There is one exception; no TE 
mode exists for / = 0. 

For a cylindrical cavity of radius a and length L, corresponding to TE 
modes in a cylindrical waveguide, there are TE nmi modes (/^0) whose 
eigenvalues are given by 

k, 2 = o> 2 W = (KJa) 2 + (InjL) 2 (4.6) 

Similarly, corresponding to TM modes of the waveguide, there are TM nm , 
modes with the eigenvalues given by 

k a 2 = oj a 2 efi = ( UJaf + (/tt/L) 2 (4.7) 

where / is allowed to be equal to zero. The resonant frequencies for /=0 
are exactly the cutoff frequencies of the corresponding modes in the wave- 
guide. 

As illustrated above, if a finite electromagnetic field exists in a space 
completely enclosed by a perfect conductor, the field should have at least 
one of the discrete frequencies determined by to a = k a (en)~ 1/2 where the 
k a ' s are the eigenvalues of (4.3) and (4.4). 

In order to excite, detect, or utilize the electromagnetic field in the cavity, 
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let us next consider two small openings as shown in Fig. 4.1, which connect 
the inside of the cavity to waveguide ports I and II. Suppose a wave with 
frequency co is incident on port I, and co is equal to one of the discrete 
frequencies mentioned above, then the electromagnetic field will be strongly 
excited in the cavity and a substantial amount of power will leak out to 
port II. On the other hand, if co is different from any one of the co^’s, port I 
and port II will be isolated by the cavity since it cannot contain a field with 





U ) ! CO 2 (lJ 3 (O 

Fig. 4.2. Cavity output power versus co. 



co different from the co a 's. This, however, is true only for the idealized cavity 
with no opening. Because of the opening, if the difference between to and co a 
is small, a weak electromagnetic field will be excited in the cavity and some 
power will leak out of port II. As a result, when co is varied keeping the 
amplitude of the wave incident on port I constant, the power out of port II 
may look like Fig. 4.2. At <o lt co 2 , co 3 , and so on, determined by the geometry 
of the cavity, the output power becomes large and as co moves away from 
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these discrete frequencies, it drops rapidly. Each of the bell shaped curves is 
called a resonant curve. The heights of the peaks are different from one 
another because different modes are excited differently. When the cavity is 
close to the idealized case, the output power drops quickly as co changes 
from each co a , and the width of the resonant curve becomes narrow. On the 
other hand, if the resonant curve is broad, it shows that the cavity behaves 
quite differently from the idealized case for the particular mode. Thus, the 
width of the resonant curve gives some indication about the quality of the 
cavity and how close it is to being ideal. The resonant curve is sometimes 
called the 0-curve after the first letter of the word “quality.” The center 
frequency of the curve is equal to o) a (for a more precise discussion, see 
Sections 4.2 and 4.3), a property which can be used to measure frequencies 
when we change the length of the cavity to make the output power maximum. 
If either the approximate frequency is known beforehand or the cavity is 
constructed so that only one mode can be excited easily, the resonant mode 
can be identified, and the frequency c o a can be calculated from the dimen- 
sions of the cavity. Cylindrical cavities are widely used for this purpose 
since it is relatively easy to build them precisely using a lathe. The modes 
usually employed for frequency measurements are circular TE 01i or TE U1 
where / is a small integer such as 1, 2, or 3. 

4.2 Expansions of Electromagnetic Fields 

A complete set of orthonormal functions cannot be derived from the 
eigenvalue problem defined by (4.3) and (4.4); nevertheless, the two- 
dimensional problem studied in the previous chapter suggests that the solu- 
tions of the eigenvalue problem 

V x V x E - ?V -E - k 2 E = 0 (in V) (4.8) 

n x E = 0 , V *E = 0 (onS) (4.9) 

may give a desired set; in fact, this eigenvalue problem can be discussed in 
exactly the same manner as before. Thus, the eigenvalues are real and non- 
negative (positive or zero), and the real eigenfunctions can be chosen to 
form a complete set. The variational expression for k 2 is given by 

I {(V x E) 2 + (V *E) 2 } dv-2 (nxE-VxErfS 

k 2 (E) = J 

J E ^ 



(4.10) 
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where the volume integral and surface integral are over V and S, respectively. 
The orthogonality and normalization conditions are expressed by 



| E Jt -E w dv = 0 (n*m) 

J E n ♦ E m t/y = 1 (n = m) 



Furthermore, each function in the complete set can be chosen so as to belong 
to one of the following three groups. 

I. VxE„ = 0, V-E a = 0 

II. VxE„^0, V • E„ = 0 

III. VxE„ = 0, V • E„ ^ 0 

In terms of the eigenfunctions thus obtained, an arbitrary vector function 
F can be expanded as follows, provided it is piecewise-continuous (i.e., 
continuous except on a finite number of surfaces each with a finite area) and 
square-integrable over V: 



where 



F = E A E » 



U = /f.e„ 



The real meaning of the equality in (4.11) is given by 
lim (Yf- £ A n E^j dv = 0 

JV-oo J \ n = 1 / 



(4.11) 



(4.12) 



(4.13) 



When F is a complex vector function, the real and imaginary parts can be 
expanded separately and then added to obtain the formulas identical to 
(4.11) and (4.12). No proofs will be given here for the above assertions 
since they are obvious from the previous discussion for the two-dimensional 
case. A brief discussion concerning the proof of the relation lim,,.,,*, k n z — co 
will, however, be given in Appendix I. 

Following a procedure similar to the eigenvalue problem for H t in the 
previous chapter, we have 

VxVxH — VV*H — k 2 H — 0 (in V) (4.14) 

n *H = 0, n x V xH = 0 (on S) (4.15) 



n *H = 0, nxVxH = 0 
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A variational expression for k 2 is given by 

f {(V x H) 2 + (V-H) 2 } dv- 2 f (n *H)(V *H ) dS 
k 2 (H) = (4.16) 

S H ' dv 

The orthogonality and normalization conditions are 

J H„.H„* = 0 Jh„.H„ dv = l (n = m) 

Furthermore, we shall assume that the eigenfunctions are real and belong 
to one of the following three groups: 

I. VxH m = 0, V • H m = 0 

II. VxH m #0, V • H m = 0 

III. VxH„, = 0, V • H m 7^ 0 

A piecewise-continuous and square-integrable, but otherwise arbirtary, 
vector function F, defined in V, can be expanded in terms of these functions: 

00 

F= Z B m U m (4.17) 

m= 1 

where 

B m = j F-H m dv (4.18) 

We have obtained two complete sets of orthonormal functions, the E ’s 
and the H m ’s, but those belonging to group II are closely related to each 
other. To show this, we first note that their eigenvalues are not equal to 
zero, as can be seen from the variational expressions. Using E fl , which is 
one of the E„’s belonging to group II, we can define a function H a through 

V x E a = k a H„ (4.19) 

where k a is the nonzero eigenvalue of E a . Therefore, H a belongs to group II 
and can be adapted as one of the H„’s to form a complete set for the follow- 
ing reasons. Since V-H fl = 0, V *E a = 0, and 

V x V x H a - k a 2 H a = k; 1 \ x (V x V x E a - k 2 E.) = 0 (in V) 

the function H a satisfies (4.14). On S, it satisfies 

n x V x H fl = k~ ‘n x V x V x E a = k;'n x k a 2 E a = 0 (on S) 
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which is one of the boundary conditions in (4.15). For the other boundary 
condition, let us consider the surface integral of n*H a over AS, an arbitrary 
part of S. It is given by 

I n-H „dS = k;' f n • V x E a dS = 1 f E a -dl 

J AS J AS J AL 

where AL is a closed contour around AS. Since the component of E a 
tangential to S is equal to zero, the right-hand side vanishes regardless of the 
size and shape of dS*. This means that 

n • H a = 0 (onS) 



Thus, H a satisfies (4.14) and (4.15). Furthermore, for H a and H b corre- 
sponding to E a and E b , respectively, we have 



| H a H b dv = {k a k b y l J V x E a . V x E ft dv 



= (*.**) 1 J n 



x E a ■ V x E b dS + 



(hlK)j 



E„ • E fc dv 



The first term on the right-hand side is equal to zero since n x E a = 0. If 
a^ b, the second term also vanishes due to the orthogonality relation between 
E a and E b . If a = b, the integral becomes unity. These two conclusions mean 
that the H a ’s satisfy the orthonormal conditions when the E a ’s do. We have 
seen that an eigenfunction H a belonging to group II can be derived from 
each eigenfunction E a in group II. Conversely, if H a belongs to group II, 
E a can be defined through 

V x H a = k a E a (4.20) 



As a result, every eigenfunction belonging to group II can be made to have a 
counterpart in the other set without loss of completeness. Let us assume that 
this has been done in the remainder of the discussion. 

In order to distinguish eigenfunctions in groups I and III from those in 
group II, Greek subscripts will be used, i.e. ; 



V x E v = 0 (4.21) 

VxH^O (4.22) 



We are now in a position to solve Maxwell’s equations using a strategy 
similar to the one employed in Section 3.5. First, we shall expand all the 
quantities appearing in Maxwell’s equations in terms of appropriate sets of 
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functions obtained above. We shall then determine the coefficients by sub- 
stituting the results into Maxwell’s equations, thereby obtaining the electric 
and magnetic fields in expanded forms. 

If there were no openings or losses in the walls, the E a ’s and the H a ’s 
would suffice to define the electric and magnetic fields. However, because of 
these imperfections, all the functions, the E a ’s, E v ’s, H a ’s, and H/s, become 
necessary for our discussion. For simplicity, let us first consider a cavity 
with only one opening connected to a waveguide. In order to facilitate the 
connection of the fields in the cavity with those in the waveguide, let us 
extend the cavity region V into the waveguide, as shown in Fig. 4.3 where 




Fig. 4.3. Cavity volume V is extended to reference plane A'o. 

S Q is a cross section of the waveguide defining the extent of V. Let S' be the 
cavity wall surface so that S' + S 0 forms a closed surface S completely 
enclosing V. 

Maxwell’s equations, which we are going to solve, are given by 

V x E = - jcop H (4.23) 

V x H = (o + jcne) E (4.24) 

On the right-hand side of (4.24), a has been introduced for several reasons. 
Although we shall assume that a joe , if a is finite, the effect of <r becomes 
important at and near the resonant frequencies and hence cannot be neglect- 
ed. Furthermore, use will be made of the fact that k a 2 = c ofep is real and, 
hence, the replacement of e by e + o/ja) in the final result will not give a 
correct answer as it did in the case of waveguides. 



178 



4. RESONANT CAVITIES 



The expansions of E and H are given by 



E = £E„ j E-E,* + i;E, Je-E, A; 
H = £ H„ | H-H. dv + £ H, J H H ; 



(4.25) 



Next, considering that V x E is an H-like function, let us expand it in 
the form 

V x E = EH fl J V xE.H fl du + J V x EH x dv (4.27) 

The first expansion coefficients on the right-hand side can be rewritten as 
follows : 

J V x E-H a dv = J (E.V x H a + VE x H a ) dv 
= k a J E -E a dv + J n x E H a dS 
Similarly, since VxH A = 0,we have 

J V x E • H a dv = J n x E • H A dS 

Substituting these expression sinto (4.27), the expansion of V x E becomes 

V x E = Y U a (k a f E‘E a dv+ f n x EH a dS ) + £ H x f n x E-H^dS 
* V J J J 1 J (4.28) 

If we took V x (4.25) and calculated the term-by-term differentiation of the 
right-hand side, the surface integrals in (4.28) would not appear, thus leading 
to an incorrect result. This is due to the fact that the term-by-term differen- 
tiation of an infinite series is not always allowable, especially when the bound- 
ary conditions of the functions in the series expansion are different from 
those of the function being expanded. 

Finally, let us expand V x H in terms of the E a ’s and E v ’s. A little manip- 
ulation similar to that leading to (4.28) gives 



VxH = Z E A J »H a 



(4.29) 



The surface integrals corresponding to the ones in (4.28) do not appear 
because n x E a and nxE, are both equal to zero on S. 
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Since all the necessary quantities have been obtained in their expanded 
forms, let us substitute them into (4.23) and (4.24). The results are given by 





L H * J E ‘ £ fl dy + JnxE»H a ds'j + £ H A J n x E*H A dS 






= -jw(T H a Jh-H^u + XH, J n , H dv ^j 


(4.30) 


r 


IEA /h*H„ dv 




i 

\ 

j 


= (o+j<oe)(j?E a ^ E-E a dv + £ E v J E • E v dvj 


(4.31) 


• 


Equating the coefficients of each eigenfunction on both sides of these equa- 
tions we obtain 


> 

.> 

ji 


k a j E E a dv + J n x E H a dS = - jop j H-H a dv 


(4.32) 


V j 

|( 


Jn x E *H a dS = - jap J H • H A dv 


(4.33) 


■i ' 


K J H • H a dv = (a -J- joe) J E • E a dv 


(4.34) 


| ; 


j E*E V dv = 0 


(4.35) 




If we eliminate one of the volume integrals from (4.32) and (4.34), say 
J E • E a dv, we obtain 


j 


{jop + k a 2 l(o + joe)} j H*H a dv = -JnxE-H a dS 


(4.36) 



Now suppose n x E is given on S, then J H*H a dy can be calculated from 
I ( 4 - 36 )- Substituting this into (4.34), j E*E a dv will be determined. Further- 

I more ’ J H-H a dv can be calculated from (4.33) while { E-E v dv is equal to 

| > 2ero from (4-35). Substituting all these volume integrals into (4.25) and 

(4.26), the electric and magnetic fields are obtained in their expanded forms 
f. f ■ and Maxwell’s equations are solved assuming n x E is known. To complete 

the solution our next task is to find n x E on S = S 0 + S'. First consider 
n x E on S 0 . Since S 0 is a cross section of the waveguide, the component E ( | 
of E tangential to S 0 can be expanded in terms of the eigenfunctions in the 
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waveguide : 






(4.37) 



where V„ is the expansion coefficient which can be interpreted as the voltage 
associated with the nth mode in the waveguide as discussed in Section 3.5. 

Next, to calculate the surface integral on the right-hand side of (4.36), 
let us expand -k x H„ on S 0 in terms of the E,/s: 

-kxH„ = ££,„/„„ f 

n 

where I a „ is the expansion coefficient. Multiplying both sides by k x and 
using the relation 



we have 



k x k x H fl = k(k H a ) - H a (k-k) = - H a 
H« = E k x E tn / an 



(4.38) 



Noting that the direction of k is opposite to that of n, we obtain from (4.37) 
and (4.38) 

- f nxE*H a rfS= f k x E • H a dS 
J S 0 J Sq 

“f ZkxE,„F„.XkxE,„7„<iS 

J So n n 

= I KL (4.39) 



where a formula similar to (3.85) is used. In much the same way, H A can be 
expressed in the form 

h a = I k x EJ X „ (4.40) 

n 

and the contribution from S 0 to the surface integral on the left-hand side 
of (4.33) becomes 



f nxE.H^S = -^F./ lt 

J So n 



(4.41) 



Having obtained the integrals over S 0 , let us now turn our attention to 
the wall surface S'. For simplicity, we assume that (o is close to co p = 
k p (e.p)~ 1/2 where k p is one of the kfs and that only the pth mode is strongly 
excited. All the other terms in the field expansions will be considered small 
compared to this mode. Then H can be replaced by H p J H*H p dv in order 
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to calculate the effect of the wall losses. As we discussed in Section 3.8, the 
boundary condition on S' is given by 



n x E = ZJl 



and, hence, we have 



n x E = Z^Hp 



fn.H, 



(4.42) 



under the assumption mentioned above. From (4.42), the surface integral 
of n x E*H P over S' becomes 



J s n X E • H„ dS = (1 + j) 1 j H-H p dv (4.43) 

where 

Qsp = (1 + j )~ ' («V<r 1 £ Z„ H„ 2 dS (4.44) 

Substituting (4.39) and (4.43) into (4.36), with a =p and assuming yens > o, 
we obtain 

i k p 2 ~ 0)2 e k + J 03 ^ + 0 + J ) J 038 WpkQsp 1 } f H • H p dv= jeos J VJ pn 

J n 

(4.45) 

Dividing both sides by j(oa) p ep and rearranging the terms, when co is close 
to to p the above equation reduces to 

y vi 

1 ’ n pn 

H • H p dv = (4 46) 

(OpP j {(o)lo p ') - (oiplw)} + (1/e/) 



where 

< = (u p( 1 - Msp l )> (i/e;) = (i/e P ) + (i/e SP ), (i/e P ) = wm 

(4.47) 

Since the first and second terms in the brackets on the left-hand side of 
(4.45) almost cancel each other, the contribution from (4.43) to the left-hand 
side of (4.45) cannot be neglected in obtaining (4.46). On the other hand, since 
Z w is small, the contribution from the surface integral of n x E*H a (a ^ p) 
over S is negligible compared to the other terms in (4.36). Thus, we have 



H.H„rfD = A. 

J (OAl i 



i I KL 

n 

o>Jij{(a>l(o a ) - (a >J(o)} + (1 IQ a ) 



(4.48) 
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where a is not equal to p. Similarly, we obtain from (4.33) and (4.41) 



f £ yjx 

H.H \dv = - 

J Jm 



Substituting (4.46), (4.48), and (4.49) into (4.26), the magnetic field in the 
cavity is obtained: 

, I V Jm 



H = H. 



¥i{(»W - K'/®)} + (i/e;) 
, Z KL 



, Z KL Z E„/,„ 

_l_ V H _ h y H- - 

“ <o a pj {(m/coj - (cojco)} + (1/0.) T * jcop 



(4.50) 



Similarly, E is given by 



>-/ V ■ 






\ £ / l r mi {(®/®;) - («;/«>)} + (i/e;) 

, Z v.4. 1 

°*p “copj {(w/w 0 ) - (©./to)} + (l/0.j. 



(4.51) 



This completes the solution of Maxwell’s equations. 

Let us next calculate the input admittance of the cavity, looking in from 
S 0 . To do so, we assume that S 0 is far from any waveguide discontinuities 
and that only one propagating waveguide mode exists there; i.e., all the 
other modes are negligibly small. From (4.38), (4.40), and (4.50), the 
tangential component of H on S 0 is given by 



H„ = Z Z k X E . -- + Z Z k X E, 

« "■ 0J p pj {((olajp) - (co p /(O)} + (1/0 p ) „ m 



Z L. — 7 
L «*p o) a pj 



- (cojco)} + (1/0.) 



A J0)p J 



(4-52) 



If subscript 1 indicates the propagating mode in (4.52), the terms other than 
those corresponding to m = n = 1 must all be negligible, by hypothesis. On 
the other hand, the same magnetic field can be written in the form 



Hu — k x E fl 7, 



(4.53) 



where /j is the current associated with the propagating mode. Therefore, 
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/j is given by 






/, = JlEl 

<o p pj {(«/<) - «/m)} + ( 1 /e;) 



U °>aHJ 



(W - W©)} + (1 IQa) L M l 



y vi 
L 



Since V i is the voltage associated with the propagating mode in the wave- 
guide, the input admittance of the cavity becomes 



where 



Y= h = J 

V 1 ™ p pj {(cojco p ') - «/©)} + (1/e/) + Yp (4 ' 54) 



Li °>J*J 



L j {(©K) - ((OJCO)} + (1/e.) L jCOfi 

a*p A 



h) UJI s 



(4.55) 



The term Y p is a slowly varying function of co in the vicinity of co = co p in 
contrast to the first term on the right-hand side of (4.54). 

Corresponding to (4.54), for resonant cavities with two openings, such 
as that illustrated in Fig. 4.1, we obtain 



h = Y n V 1 + Y l2 V 2 , I 2 = Y 12 V i + F 22 F 2 



(4.56) 



where 



co p p j {(cojco J) - (©;/©)} + (1/e/) 

+ y J_ Ual Vhihj 

L - (©./©)} + (l/e.) + [j jcop ^ 

o ? P J 

(4.57) 

Equations (4.54) and (4.56) will be studied in detail in the next section. 

The electric and magnetic fields of the pth mode in the cavity, [E] and 
[H] p , are given by 

//A 1/2 1 £ I P ” 



[E] p = -7 



[H] p = 



£ / (opj {(cojco f) - (c ofjco)} + (l/e;) 

£ VJpn 



co P nj {(co/co;) - (»;/»)} + (i/e;) 
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Therefore, the relation 

i J ‘ [E]„ • [E ] p * do = i J „ [H] p • [H] p * dv (4.58) 

holds at co = a) p . This means that the average electric and magnetic stored 
energies of a resonant mode are equal at the resonant frequency. Since E p 
and H p are real, [E] p and [H] p are 90° out of phase. When the electric 
stored energy becomes maximum, the magnetic stored energy becomes zero 
and vice versa. However, the sum of these stored energies always remains 
constant at the resonant frequency. This sum is twice as large as the average 
electric or magnetic stored energy. 

As defined in (4.47), Q p ' has the same physical meaning as the ordinary 
Q of a resonant circuit in low frequency ranges. This can be shown by re- 
writing 1 jQ p as follows 



1 


<r 1 1 f 

= | 1 7 II 2 Jr 




e; 


(O p E (O p nl+jJ 






J *[E],-[E],* dv Re j 


Z w [H] p .[H] p * dS 




J £ [ E ] P ‘ [ E ]p* dv o) P 


j* d [H] p • [H] p * dv 



(power loss in medium) 

2 (o p (average electric stored energy) 

(power loss in wall) 

2co p (average magnetic stored energy) 

(total power loss) 

(o p (total electromagnetic stored energy) 

In order to obtain (4.50), an assumption was made that only the />th 
mode was strongly excited in the cavity. However, if the mode is degenerate 
or if there are one or more tu„’s close to (o p , this assumption is no longer 
valid. In this case, several modes must be assumed, having the same order of 
magnitude, and whose resonant frequencies are close to the frequency 
under consideration. The right-hand side of (4.42) becomes a linear combi- 
nation of the contributions from all of these modes, and corresponding to 
(4.45), we obtain simultaneous equations from which the expansion coeffi- 
cients can be calculated. This situation is similar to the degenerate case in 
Section 3.8 in which the effect of waveguide wall losses is discussed. 
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4,3 Equivalent Circuits 

In the previous section, the input admittance of a cavity with one opening 
was calculated and given by (4.54). If <r = 0, all the 0/QJs vanish and Y p 
must become pure imaginary. In practice, however, a finite conductance may 
appear due to wall losses. This follows even though the conductance of each 
term in Y p attributable to the wall losses is negligible compared to the sus- 
ceptance of the same term. When the summation of an infinite number of 
terms is carried out, the conductance components add to a small, but finite, 
contribution while most of the susceptance components cancel each other. 
The effect of wall losses cannot be calculated by the simple approximate 
method we have used. However, the following discussion will not be affected 
since no assumption will be made as to the origin of the real part of Y p . 

For simplicity, let us assume that Y p is a constant since it varies slowly 
with to in the vicinity of co p , the frequency of interest. The inverse of the 
first term on the right-hand side of (4.54) expresses the impedance due to 
the resonant mode. Since the real part remains constant as c o varies, the 
locus of the impedance on the Smith chart must therefore be a constant 
resistance circle. The admittance locus, i.e., the inverse of the impedance, 
is a circle tangent to the periphery of the Smith chart where the reflection 
coefficient r—- 1 as shown in Fig. 4.4(a). Adding Y p to the circle, it follows 
from the property of bilinear transformations that the locus of Y becomes 
another circle as shown in Fig. 4.4(b). Let d p be the angle between the zero- 
susceptance line and the straight line passing through Y p and the center of 
the Smith chart. Furthermore, let co 0 be the angular frequency corresponding 
to the intersection of the admittance locus with this straight line as shown 
in Fig. 4.4(b). If the reference plane S 0 , at which the input admittance is 




Y p 

(a) (b) 

Fig. 4.4. Construction of the locus Y on a Smith chart. 
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calculated, is shifted toward the generator by 

L p = (9J4n)X g 

the locus as a whole rotates clockwise by 6 P around the center of the Smith 
chart. As a result, the input admittance locus from the new reference plane 
may look like the circle shown in Fig. 4.5. Strictly speaking, the wavelength 
X g in the guide varies with co. However, X g is assumed to be constant since 
we are considering a narrow range of co in the vicinity of io p . 



Fig. 4.5. Input admittance at the new reference plane So. 




With the new S 0 plane, the region V of the cavity is no longer the same 
as before, and the resonant frequencies together with the eigenfunctions 
are different from the original ones. However, the general expression for 
the input admittance should remain unchanged and is given by (4.54). As 
we can easily see from Fig. 4.5, co p and Y p for the new cavity must be given 
by (o 0 and a small conductance G, respectively. Thus, the normalized input 
admittance becomes 



— = + — (4.59) 

Y 0 j {(©/©o) - (© 0 /©)} + (1/Co) Y 0 

where the external Q and the unloaded Q are defined through 
1 IQ e « = I 2 P il<o p nY 0 , l/0o = l/2/ 

for the new cavity, and Y 0 is the characteristic admittance of the propagating 
mode in the waveguide. The admittance Y can be considered as a parallel 
connection of a series resonant circuit and the conductance G. Comparing 
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the inverse of the resonant term 

j{(coK)-( rao H} e ^ + - e "‘ 

* o 'J.qYq 

and an ordinary impedance expression for a series resonant circuit 

j{a)L - (1 /©C)} + R 

we obtain an equivalent circuit of Y as shown in Fig. 4.6 where 

L = Q'J<°oY 0 , C= Y 0 /<a 0 Q,„, R = Q'JQ 0 r 0 (4.60) 




Fig. 4.6. Equivalent circuit of Y. 



Let us consider the voltage at the original plane S 0 or at a reference plane 
distant nXJ2 from it, where n is a small integer. As a function of co, this 
voltage changes proportionally to the straight-line distance from P to the 
point corresponding to co on the locus in Fig. 4.4(b). Therefore, if© is varied, 
while the incident voltage remains constant, the voltage at this plane should 
vary as illustrated by the curve in Fig. 4.7(a). On the other hand, the voltage 
as a function of co at the new S 0 plane, or nXJ2 away from it, should look 
like the solid curve in Fig. 4.7(b). Consequently, the new S 0 plane can easily 
be determined with a standing wave detector. Note that the new S 0 plane 
is a voltage maximum point at frequencies far from resonance. The dotted 
line in Fig. 4.7(b) shows the voltage at a reference plane XJ4 away from the 
new S 0 plane. This voltage is proportional to the straight-line distance 
from the point 0 to the point corresponding to co on the locus in Fig. 4.5. 
When the center of the Smith chart is inside the locus, the solid and dotted 
curves overlap each other as shown in Fig. 4.7(b); however, if the center is 
outside, the peak of the dotted curve becomes lower than the valley of the 
solid curve and no overlapping takes place. For a reason which will become 
clear after the discussion of power, the cavity is said to be overcoupled when 
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(O) ( bl 

Fig. 4.7. Voltage versus co at the reference planes. 



the locus includes the center of the Smith chart, and undercoupled if it does 
not. 

We next consider the position of the voltage minimum as a function of co. 
In the case of overcoupling, referring to Fig. 4.5, the straight-line distance 
from P to the point corresponding to co 0 + Aco on the locus is minimized 
when the locus as a whole is rotated counterclockwise around the center of 
the Smith chart through the angle 0. The counterclockwise rotation of the 
locus through 6 means that the reference plane is shifted toward the load, 
a distance L = (0/4n) k g , and the length being minimum indicates that the 
voltage at this new reference plane is minimum. Thus, the position of voltage 
minimum at co=co 0 + Aco is located L = (9/4n) k g toward the load from the 
voltage minimum point for <w 0 - In the above procedure, instead of rotating 
the locus, P can be rotated in the opposite direction through 6 to P'. The 
straight-line distance from P' to the point corresponding to co 0 + Aco on 
the locus is then minimized indicating that the voltage minimum point for 
ct) 0 + Aco shifts L = (9/4n) k g toward the load from the minimum point for 
co 0 . Repeating a similar procedure for each co in the vicinity of co 0 , the 
position of voltage minimum versus co can be plotted as illustrated in Fig. 
4.8(a). In the case of undercoupling, the locus does not enclose the center 
of the Smith chart, and the position of voltage minimum remains in the 
vicinity of the point O, shown in Fig. 4.5. The position of voltage minimum 
then changes with co, as shown in Fig. 4.8(b), which is quite different from 
Fig. 4.8(a). 
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1 — + -\~. 



i i 

i i 



w 0 -Au> ( cu 0 u> 0 +Ao>| oj 



w 0 -Au>, m 0 o; 0 +Aa* ( u) 



Fig. 4.8. Voltage minimum point versus co (a) overcoupling; (b) undercoupling. 



The value of G/Y 0 can be obtained from the standing wave ratio at 
frequencies sufficiently away from resonance (co 0 ) that the first term on the 
right-hand side of (4.59) can be neglected compared to the second term. 
The frequencies should not be so far away that the value of Y p is affected 
by the resonances of other modes. On the other hand, the value of Q 0 !Q ext + 
+ GfY 0 can be obtained from the standing waves ratio at co 0 . From these 
two values, we calculate 

Yi _ Qo G 

To e„,(i+;) + r„ (4 ' 61) 



and obtain the straight-line distances from P or O to YJY 0 on the Smith 
chart, and hence the relative voltages at frequencies to, =co Q ± Aco x corre- 
sponding to YJY 0 . The frequency Aco x can be determined by measuring the 
voltage versus co at an appropriate reference plane as shown in Fig. 4.7(b) 
and finding the frequencies at which the relative voltage becomes the value 
obtained above. Similarly, if 9, corresponding to YJY 0 is obtained on the 
Smith chart, the same frequencies co 0 + Aco x can be determined by taking 
two points distant L x = (0 l /4n) k g from the point of symmetry on the mea- 
sured curve of the voltage minimum point versus co as shown in Fig. 4.8(a) 
or (b). Once Aco x is obtained, Q 0 can be calculated from 

Qo = (co 0 /2 Aoj x ) 

since a comparison of (4.59) and (4.61) shows that 

I/Go = IKM)) — (coo/co^j ~ (2 Aco l /co 0 ) 



(4.62) 
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Furthermore, Q ext can be determined from the measured values of G/Y 0 
and Q 0 !Q eM + G/Y 0 . Thus, all the parameters necessary to determine the 
equivalent circuit shown in Fig. 4.6 are experimentally obtainable. 

Neglecting G, which is usually small, and assuming the generator admit- 
tance is equal to the characteristic admittance of the waveguide propagating 
mode, the equivalent circuit including the generator admittance is represented 
by Fig. 4.9. The Q of the circuit on the right-hand side of S 0 is given by 
Q 0 = co 0 LIR. The Q of the whole circuit including Y 0 is called the loaded Q 
and is indicated by Q L \ and \/Q L is calculated as 



1 = R + 0 IJQ = 1 J_ 

Ql Qo Qext 



(4.63) 



Fig. 4.9. Equivalent circuit of a cavity including the generator admittance. 





Consequently, Q ext can be interpreted as the Q corresponding to the power £ 

loss into Y 0 through the S 0 plane. If Q ext is smaller than Q 0 , the energy } 

escaping to the outside through the S 0 plane is larger than the energy > 

consumed inside the cavity, and hence the cavity is overcoupled. On the i 

other hand, if Q ext is larger than Q 0 , the cavity is undercoupled to the f 

outside. The center of the Smith chart is inside or outside the locus depending {■ 

on whether the cavity is over- or undercoupled, respectively. If the effect of ;f 

G is taken into account, the situation is different. Generally speaking, £ 

however, the cavity is said to be overcoupled whenever the locus includes 
the Smith chart center and is otherwise undercoupled. These two different 
states can easily be distinguished by performing an experiment similar to :: 

the one leading to Fig. 4.7(b) or Fig. 4.8. 

Let us next discuss the equivalent circuit of a two-port resonant cavity. j 

( 

l 
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To simplify the discussion, we restrict ourselves to the case in which the 
coupling between the input and output is due to one single resonant mode. 
In this case, the contribution to Y 12 comes from the first term on the right- 
hand side of (4.57), and other terms become zero. Furthermore, if the S 0 
plane in each waveguide is shifted to a voltage maximum point for frequencies 
far from resonance, as we did with the one-port cavity, Y u as well as Y 22 
is represented by a resonant term plus a small conductance. We neglect the 
effect of the small conductances in Y u and Y 22 . Equation (4.56) then 
becomes 



hy lpi 

"0^7 {(a>K) - (a>o/fi>)} + (1/Go) 

1 Wo 2 

a> 0 /i j {(«/«„) - (wolco)} + (TJQo) 



‘‘r -■ * '.) 

K, + Y ^ 

'01 / 



(4.64) 



If port II is short circuited, for example, by setting V 2 = 0, the input imped- 
ance of the resonant cavity from port I becomes a series resonant circuit. 
Similarly, if port I is short circuited, the cavity appears to port II as a series 
resonant circuit with the same resonant frequency as before, but the magni- 
tude of the impedance is different. Therefore, an equivalent circuit of the 
cavity may look like Fig. 4.10, where a series resonant circuit Z is connected 
to the input and output circuits through transformers. 

Let us study the relation between the currents and voltages in Fig. 4.10. 
The current 7 0 flowing through Z is driven by the difference of electromotive 
forces n 1 V 1 and n 2 V 2> and we have 

/o ~ Z -1 (hj F j — n 2 V 2 ) 

From this, I t is given by 

h=n l I 0 = Z- i {n l 2 V 1 -n,n 2 F 2 } 




1 : n i n 2 : i 

Fig. 4.10. Equivalent circuit of a two-port resonant cavity. 
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Similarly, I 2 becomes 

h — ~ n 2?o = Z 1 { — w i w 2 + t*2 2 V 2 } 

Comparing these equations with (4.64), if the equivalent circuit is to represent 
the two-port cavity under consideration, we obtain 



Z = \l( m + ‘1 010,1 

l V^O ^ / Qo) ^01^02 

n l ~ Ooilhl) 112 > n 2 = ~ (A)2/^0l) ,/2 
and Z is a series resonant circuit of L, C, and R given by 



L — fijl 01/02 5 c — 1/g>0 L ^ i? — o)qLIQ 0 (4.65) 

respectively. 

Now, we are in a position to calculate the output power as a function of 
c o using this equivalent circuit. For simplicity, both generator and load are 
assumed to be matched to the waveguide characteristic impedances. The 
equivalent circuit, including the generator and load, is therefore given by 
Fig. 4.11(a) which becomes Fig. 4.11(b) when the transformers are elimi- 
nated. From Fig. 4.1 1(b), the output power is calculated to be 



P L {o>) = n 2 R L \I 0 [ 



n 2 2 R L \n 1 E 



\n x z R g + n 2 2 R L + R + jo> 0 L {(w/a> 0 ) - (a> 0 /co)}| : 




If Qex m an d Qext, 2 are defined by 

l/<2ex.,i = n^RJcooL, 1 IQ cxu2 = n 2 z RJo) 0 L (4.67) 

then, these external Q's have a physical meaning similar to the external Q 
of one-port resonant cavities; i.e., and Q ext 2 are the 0’s due to the 

power losses through ports 1 and 2, respectively. The loaded Q is then 
calculated from 

(l/ej = (1 IQ..,. ,) + 2 ) + (l/Qo) (4.68) 

In terms of Q L , (4.66) can be rewritten in the form 

P ( 02 ) = n 2 n 2 2 R L E 2 Q L 2 ^ P L (co 0 ) 

L {oy 0 Lf |1 + jQ L {(a>/(o 0 ) - ((OqIco)} \ 2 1 + Q L 2 (2 Aq)/(o 0 ) 2 

(4.69) 

where Aco = co — (o 0 . The output power as a function of co becomes a 
typical bell shape as shown in Fig. 4.12. The difference of the frequencies 
at which the output power decreases to the one-half of the maximum is 
given by 2 Aco= a> 0 IQ L . This relation is often used for the measurement 

of 




Fig. 4.12. Output power versus cx> of the two-port cavity. 

4.4 Perturbation of Boundaries 

Suppose the cavity wall is slightly deformed from S' to 5" as shown in 
Fig. 4.13, where the deformed surface is indicated by AS. The volume of 
the cavity is now V — A V, where AV is the volume of the deformed part. 
The eigenfunctions as well as the eigenvalues will now all be different from 
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Fig. 4.13. A small perturbation of the wall of a cavity. 



the original ones. As a result, J pl , co p ', and Y p in (4.54) are expected to change. 
However, since the denominator of the first term on the right-hand side al- 
most cancels in the vicinity of a > p ', the largest effect on the input admittance 
comes from the variation in co p . For this reason, the variation of (D p due 
to the deformation is particularly worth studying. However, since oj p is 
directly proportional to co p and Q Sp is expected to remain almost the same, 
we have only to investigate the variation of o) p , or k p z which is equal to 
c o 2 vfi , in order to study the co p variation. 

Let us use the variational expression for k 2 to investigate the variation 
of k 2 . Noting that the expression gives an approximation for the eigenvalue 
one order higher than that of the trial function used, we use the original 
eigenfunction E p as a trial function. Then, k 2 (E p ) is given by 
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f ( v xiy 

J AV 



2 dv + k 2 E 2 dv 



n x E p • V x E p dS 



where (V-E p = 0) and the normalization condition of E p are used. The last 
surface integral is equal to the integral over dS since n x E p = 0 elsewhere. 
However, using Gauss’s theorem and n x E p = 0 on S', we have 



x E p • V x E p dS 



= - I V *E p x V x E p dv 

J AV 

= - f {(VxE p ) 2 -E p .Vx VxE p } 

J AV 

= -f (VxE p ) 2 ^+f k 2 E 2 dv 

J AV JAV 



Combining the above two equations, we have 



k 2 (E p ) * k 2 + f (V x E p ) 2 dv - k 2 f E p 2 dv 

J AV J av 

It follows from (4.19) that this is equivalent to 

k 1 (E„) = k, 2 |l + (H p 2 - E p 2 ) (4.70) 

Equation (4.70) gives the approximate eigenvalue for the deformed cavity. 
Writing the variation in the resonant frequency by Aa > p , we obtain 



Aco p k 2 (E p )-k f 



= ±| (Hp 2 — E p 2 ) dv (4.71) 

J AV 



This can also be considered to give Ato p 'lo) p '. From (4.71) we see that the 
resonant frequency increases or decreases depending on whether the mag- 
netic-stored energy is larger or smaller than the electric-stored energy in the 
region removed from the cavity by the deformation. In other words, if the 
wall is pushed in where the electric or magnetic field is concentrated, the 
resonant frequency will be lowered or raised, respectively. This corresponds 
to the behavior of an LC resonant circuit where the resonant frequency 
goes down if we bring the electrodes of the capacitor closer together; 
whereas, if we insert a piece of metal inside the coil, the inductance decreases 
and the resonant frequency goes up. 
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In the above discussion, when E p was used as a trial function, it was tacitly 
assumed that E p was not a degenerate eigenfunction. If it is, a linear combi- 
nation of all the eigenfunctions which are degenerate to E p must be used 
in place of E in the variational expression, and the coefficients have to be 
determined so as to make k 2 ( E) stationary. Each stationary value of k 2 ( E) 
gives an eigenvalue from which the resonant frequency can be calculated. 
The situation is somewhat similar to the degenerate case discussed in 
Section 3.8. 



4.5 Cavities with Inhomogeneous Media 

In this section, let us discuss briefly how to obtain the equivalent circuit 
of resonant cavities with inhomogeneous media. Since the two sets of eigen- 
functions obtained in Section 4.2 are complete, it is possible to expand the 
expressions for electric and magnetic fields in a cavity with an inhomogeneous 
medium in terms of these sets of eigenfunctions. If this is done, however, it 
becomes impossible to assume that only one or a small number of modes are 
excited strongly, and that others are negligible. In other words, the conver- 
gence of the series becomes so poor that no useful information can be ob- 
tained. Consequently, some other sets of functions have to be found which 
are suitable for the expansion of electromagnetic fields in such cavities. 
Fortunately, the following eigenvalue problems give suitable sets of eigen- 
functions provided that e r — e/e 0 and p r — nlfi 0 remain positive and finite 
everywhere in the cavity. 

Vx^’VxE- E r VV -e^ - k\E = 0 (in V) 
nxE = 0, V*e r E = 0 (on S) 



V x e; 1 V x H - - k 2 n,li = 0 (in V) 

n • fi r H = 0 , n x e“ 1 V x H = 0 (on S) 



(4.73) 



Under the assumption that e, and p, are positive and finite, k 2 becomes real 
and nonnegative, and the eigenfunctions can be chosen to be real, without 
loss of generality. The variational expressions for the k 2 ' s are given by 



J fi~ 1 (V x E) 2 dv + J (V • £ r E) 2 dv-2 J nxE./j, *VxE dS 

J £,E 2 dv 



(4.74) 
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fc 2 (H) = 



J £ r 1 ( v x H ) 2 dv + j (V«/i r H) 2 dv-2 j n */i r H V ^ r H dS 



J"' h: 



(4.75) 



Using these expressions, the eigenfunctions can be obtained, at least con- 
ceptionally, one by one to satisfy the orthonormal conditions 






(4.76) 



( 4 - 77 ) 

In this process, every E„ can be made to belong to one of three groups 
I. VxE, = 0, V • e r E„ = 0 

n. vxe„*o, v. £ ,e„ = o 

III. VxE, = 0, V.e,E„*0 



Similarly, we have for H„ 



l. VxH m = 0, V • /i r H m = 0 

II. VxH ffl #0, V./i r H m =0 
HI. VxH m = 0, V • n r YL m =£ 0 

Furthermore, each function belonging to group II can be selected so as to 
make a pair, E a and H a , satisfying 



V x E fl = kji r H a , V x H fl = k a s r E a 



(4.78) 



If Greek subscripts are used for the functions belonging to groups I and III, 
we have 

VxE v = 0, V x H; = 0 (4.79) 

Comparing the magnitudes of the eigenvalues for cavities with homogeneous 
and inhomogeneous media, the infinite growth of the eigenvalues of the 
present problems can be proved as shown in Appendix I. Therefore, each 
set of the orthonormal functions obtained above is complete. An arbitrary 
vector function F which is piecewise-continuous and square-integrable over 
Fean be expanded in terms of either set of these functions; i.e., 



F = I ^E„, 

w= 1 



*■= Z 

m= 1 



(4.80) 
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v 



where 

,4 n = Je r F.E B dt>, B m = 



(4.81) 



For the analysis of the cavity, the quantities appearing in Maxwell’s 
equations are to be expanded in terms of these sets of functions. We use the 
E„’s and the H m ’s to expand E and H, respectively. However, if we try to 
expand V x E and V x H, we shall find that the convergences are poor 
since they do not resemble H a and E 0 , respectively. Therefore, we expand 
(i ~ 1 V x E and s ~ 1 V x H in terms of the H m ’s and the E n ’s, instead of 
V x E and V x H. The remainder of the discussion then becomes almost 
identical to that for cavities with homogeneous media. The expression for 
the input admittance remains the same as (4.54), and the discussions in 
Section 4.3 apply equally well, without modification, to the present in- 
homogeneous case. 



PROBLEMS 

4.1 Try to prove the completeness of the eigenfunctions defined by the eigenvalue 
problem: V x V x E — k 2 E = 0 (in V) and n x E = 0 (on S). Point out where 
the discussion fails. 

4.2 Calculate the change in the resonant frequency of an LC resonant circuit due to the 
insertion of a dielectric slab between the electrodes of the capacitor as shown in 
Fig. 4.14. Use two different methods: (1) Calculating the change of the capacitance; 
(2) Using the variational expression (4.74). 




Fig.' 4. 14. LC resonator with a dielectric slab between the capacitance electrodes. 



4.3 Suppose that a piece of waveguide an integer multiple of a half-wavelength long is 
short-circuited at both ends to form a cavity. Show that the attenuation constant of 
the waveguide can be calculated from the Qsp of the cavity through a = (nkg/Qspk 2 ) 
where the wall losses in the end plates are neglected. 

'4.4 Considering a length L of a rectangular waveguide short circuited at one end as a 
cavity, obtain the H a ’s and Ha’s, explicitly. 



.V ' •? 

.4 • | 

I ! 

$ h 



I £" 

s \ 



4.5 Using the above expressions and neglecting the wall losses, calculate the input 
admittance of the rectangular waveguide (TEio mode) at the reference plane distant 
L from the short-circuited end. 

4.6 Prove that the input admittance in Problem 4.5 is equal to that obtainable by treating 
the waveguide as a transmission line. (Hint: A well-known formula 

2 

1 - ( nn/0 ) 2 

may be helpful to equate the two expressions.) 

4.7 Show that the solutions of (4.8) under the boundary conditions 

n x V x E = 0, n-E = 0 (on So) 

n x E = 0, V -E — 0 (on 5') 

form a complete set of orthonormal functions. Also show that the solutions of (4.14) 
under the boundary conditions 

n x H — 0, V -H = 0 (on So) 

n x V x H = 0, n-H = 0 (on S') 

form another complete set of orthonormal functions. The boundary conditions on 
So correspond to the open-circuited condition. 

4.8 Using the above sets of eigenfunctions, obtain the input impedance of the cavity at 
the reference plane So. 

4.9 Calculate the input admittance of a cavity with twofold degenerate resonant modes. 

4.10 Use a perturbation method, similar to the one used in Section 3.8, to obtain (4.71). 

4.11 Carry out the discussion of boundary perturbation assuming the twofold degeneracy 
of the resonant modes. 

4.12 Complete the discussion for obtaining an equivalent circuit representing cavities with 
inhomogeneous media. 




n = 1 




CHAPTER 5 

MATRICES AND 
WAVEGUIDE JUNCTIONS 



In Chapter 2, some fundamental properties of vectors were reviewed. 
It was shown that a single letter could represent three components of a vector, 
and relations between the components of various vectors were conveniently 
studied using certain rules of mathematical vector operations. The intro- 
duction of vectors provided a space saving technique for describing these 
relations, and also reduced the tremendous mental effort otherwise necessary 
to handle three times as many variables representing vector components. 

In much the same way, a matrix represents several quantities in a pre- 
scribed manner. Using certain operating rules between matrices, it becomes 
possible to describe complex relations between many quantities in an orderly 
fashion. Thus, by the introduction of matrices, the description of the input 
and output relations of complex waveguide junctions becomes simpler, 
and the understanding of their behavior becomes easier. As a result, several 
useful theorems for treating microwave circuits, which otherwise would be 
difficult to find, now become readily obtainable. 

In this chapter, we shall first present matrix analysis which has proved 
useful in many branches of applied physics, including the theory of micro- 
wave circuits. Then, using matrix notations, the reciprocal theorem, lossless 
conditions, and frequency characteristics of lossless reciprocal circuits will 
be described. Also, the properties of symmetrical Y junctions and circulators 
will be discussed in detail. 

5.1 Matrices 

„ A matrix A of order mxn is a particular collection of m x n quantities. 
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They are generally arranged in m rows and n columns as follows : 

a ll °12 ••• a ln 

^ _ a 2l a 22 ■■■ a 2n 

- a ml a m2 a mn 

and a i} is called the ij component of A. Two matrices A and B are said to be 
equal when, and only when, all the corresponding components exist and 
are equal to each other; i.e., a ij = b iJ . Therefore, when they are equal, the 
matrices are of the same order. The addition of two matrices A and B is 
defined as a matrix C with its ij component c fj - being a tJ + b tJ . Addition is 
only defined between matrices of the same order. Since a u + b tJ = b- tj + a u , 
we have 

C = A + B = B+ A (5.2) 

That is, addition is commutative; the order of addition is interchangeable 
without changing the result. 

The product of matrices A of order m x / and B of order / x n is defined 
as a matrix C of order m x n with its ij component being 

C ij = X a ik&kj 
k 

Multiplication between matrices is defined only when the number of columns 
in the first matrix is equal to the number of rows in the second matrix. When 
matrices of order 2x3 and 3x2 are multiplied, we obtain a matrix of 
order 2x2, 

*12 
b 22 

£>32 

Here, for example, the elements of the second row in A are multipled by 
corresponding elements of the first column in B, and added together to 
yield the 21 component c 21 of C; a 21 b n + a 22 b 2i + a 23 b 31 = c 21 . From the 
definitions of addition and multiplication, it follows that 

A(B + C) = AB -f AC 






(5.3) 
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This is because the ij component of the matrix on the left-hand side is given 
by 

Z a ik {b kj + c k j) = Z a ikb k j + z a ik c kj 

k k k 

which is exactly equal to the sum of the ij components of the matrices on 
the right-hand side. Similarly, when multiplying a matrix by the product of 
two matrices, we have 

A (BC) = (AB) C (5.4) 

where the parentheses indicate the first operation to be performed. It is 
worth noting that AB is not necessarily equal to BA. 

A zero matrix 0 is a matrix whose components are all equal to zero. It is 
different from zero which is usually indicated by 0. However, when 0 
appears in matrix equations, it may mean a 0 matrix. The product of a 0 
matrix and an ordinary matrix makes another 0 matrix: 

AO = 0, 0A = 0 (5.5) 

Although we use the same symbol 0 for all zero matrices, they may not be 
the same matrix; the numbers of rows and columns may be different for 
different 0’s in (5.5). 

A matrix with the same number of rows and columns ( m ) is called a 
square matrix of order m\ it is really a matrix of order m x m. A unit 
matrix I is a square matrix whose main diagonal components are each 
equal to unity while all the other components are zero. We have 

IA = A, AI=A (5.6) 

In the multiplication of a matrix with a constant, all the components of the 
matrix are multiplied by the constant. Thus, cl is a square matrix with each 
of its main diagonal components equal to c while the remainder are zero. 
It follows from this that 

cA = (cl) A (5.7) 

The inverse matrix of a square matrix A is indicated by A -1 , and it is defined 
as a matrix which satisfies the following relations: 

AA -1 = I, A _1 A = I (5.8) 

From this definition, we see that if the inverse matrix of a matrix exists, it is 
unique. 

Let det A be the determinant with the same components as matrix A, 
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and let A u be the cofactor of the ij component. Then, since 

detA = XMo = Z«/A7 

J « 

Z a ij A kj = 0 (5.9) 

Z a u A tk = 0 U * k ) 

I 

the ji component of A -1 is given by A- tj divided by det A. This can be easily 
checked by substituting into (5.8). When det A = 0, A -1 does not exist. A 
square matrix A without A -1 is said to be singular; when A -1 exists, A is 
nonsingular. A nonsingular matrix is a square matrix by definition. The 
inverse matrix of the product of two nonsingular matrices A and B is given 
by 

(AB) -1 = B~ 1 A~ 1 (5.10) 

This is because B _I A _1 satisfies the definition (5.8) of the inverse matrix as 
follows: 

ABB _1 A _1 - AA 1 =1, B“ 1 A _1 AB = B 1 B = I 

Similarly, for the product of three matrices, we have 

(ABC) 1 =C -1 B _1 A _1 (5.11) 

Hence, when the inverse is taken, the order of product is reversed. 

The transposed matrix A, of a matrix A is the one with the rows and 
columns interchanged. For example. 




The ij component of the transposed matrix of a product AB is given by 

Z a jkb k i = Z bki a j k 

k k 

However, since the right-hand side is equal to the ij component of B,A„ 
we have 

(AB), = B,A, (5.12) 



The order of product is reversed for the transposed matrix. In the case of 
the product of three matrices, we have 



(ABC), = C,B,A, 



(5.13) 
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If the transposed matrix of A has all its components changed to their 
complex conjugates, the matrix thus obtained is called the adjoint matrix 
of A and is indicated by A + . Just as the transposed matrix reverses the order 
of a product, so does the adjoint matrix, i.e., 

(AB) + = B + A + (5.14) 

When A + is equal to A, A is called a self-adjoint matrix. If 

A + = A -1 (5.15) 

then A is called a unitary matrix. The product of two unitary matrices is 
again a unitary matrix since 

(AB) + = B + A + = B“ J A~ 1 = (AB)“ 1 (5.16) 

where (5.10) and (5.14) are used together with (5.15). 

A constant times a vector in three dimensional space is a vector whose 
components are multiplied by the constant. The sum of vectors is defined 
to be a vector with each component representing the sum of the correspond- 
ing components in the vectors. Consequently, there is one-to-one corre- 
spondence between matrices of order 3x1 and vectors in the three- 
dimensional space. Extending this correspondence to multidimensional 
spaces, let us call a matrix of order nx 1 a vector in n-dimensional space. 
The operation of multiplying a square matrix A of order n and a matrix x 
of order n x 1 to obtain another matrix y of order n x 1, can then be ex- 
pressed as follows: Vector x is transformed into vector y by A. When A is 
a unitary matrix, we have 

y + y = (Ax) + Ax = x + A + Ax = x + x (5.17) 

where use is made of (5.15). When the components of the vector x in three- 
dimensional space are all real, x + x expresses the square of the magnitude 
of x. Extending this notion of magnitude to more general cases, a real 
quantity x + x is considered as the square of the magnitude of vector x even 
when x is an «-dimensional vector with complex components. With this 
interpretation, (5.17) shows that the magnitude of a vector remains un- 
changed during the transformation by a unitary matrix. In other words, the 
magnitudes of vectors are invariant to unitary transformations. 

Now suppose that 

y = Ax (5.18) 



and let us study how A changes during a coordinate transformation. If T 
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is a nonsingular matrix which expresses the coordinate transformation, and 
if x' and y' are the results of the transformation, then 



x' = Tx, y' = Ty 
Substituting into (5.18), we have 



(5.19) 



or equivalently 
Comparing this with 



T _1 y' =AT ! x' 



y' = TAT -1 x' 
y' = A'x' 



we see that A is transformed into 



A = TAT 



(5.20) 



The transformation expressed by (5.20) is called the similarity transforma- 
tion by T. This name comes from the fact that the form of a matrix equation 
is preserved during the transformation. For example, suppose 

AB + CDE = F 

After the similarity transformation by T, we have 

TABT' 1 + TCDET" 1 = TFT' 1 
which can be rewritten in the form 



TAT ^BT' 1 -hTCT^TDT^TET' 1 = TFT 



or equivalently. 



A B + CDE = F' 



This has the same form as the original equation. Thus, the forms of matrix 
equations are invariant to similarity transformations. 

When a vector is transformed by A, its direction generally changes, i.e., 
the ratios between components become different from the original values. 
However, there are vectors whose directions are invariant to a given trans- 
formation A. Let us try to find such vectors. Since vector directions do not 
change by constant multiplications, we look for vectors which satisfy 



Ax = 2x 



(5.21) 



or equivalently 



(A — AI) x = 0 
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where 0 on the right-hand side means a 0 matrix as we mentioned earlier. 
The above equation is equivalent to n simultaneous equations 

(a,, -*)*i +fli 2 *2 + ■■■+ a ln x n = 0 

+(«22 ”'0*2 +■■■ + a 2 lL x n = 0 
« n l*L + a„ 2*2 + -• + ( a nn ~ A) = 0 

For nontrivial solutions to exist, the determinant of the coefficients must 
be equal to zero: 

det(A — AI) = 0 (5.22) 

This is an nth degree algebraic equation for A which will generally have n 
different roots. Corresponding to the n roots, n independent vectors 
x f (/ = 1,2,..., n) satisfying (5.21) can be determined; the vector x f is called 
an eigenvector, and the corresponding root A,- is called the eigenvalue. We 
use this terminology because of the similarity between this and the eigenvalue 
problems discussed previously in connection with waveguides and cavities. 
In those problems, we essentially searched for vector functions which became 
k 2 times the original ones when certain differential operations were per- 
formed. 

Let us define /(A) by 

/(A) = detail -A) (5.23) 

Then, (5.22) can be written as /(A) = 0. The form of /(A) is invariant to the 
similarity transformation of A by T; the proof is as follows. From (5.20), 
we have T _I A' = AT" 1 . Therefore, we obtain 

T” 1 (AI — A') = (AI — A) T” 1 

Taking the determinants of both sides, we have 

detT -1 det(AI - A') = det(AI — A) detT" 1 

Since T is nonsingular and det T " 1 is not equal to zero, the above expression 
is equivalent to 

det (AI - A') = det (AI - A) (5.24) 

which is the result we wished to prove. 

Let A l5 A 2 ,..., A n be the eigenvalues. Then we have 

/(A) = (A — Aj)(A — A 2 )...(A — A n ) (5.25) 

Since /(A) is invariant, the eigenvalues are also invariant to similarity 
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transformations of A. Furthermore, since /(A) is an nth-degree polynominal 
of A, it can be rewritten in the form 

/(A) = A" — SiA" -1 +- + (-l)"s„ (5.26) 

Because of the invariance of /(A), s x , j 2 , ..., s n are all invariant to similarity 
transformations of A. In particular, s x is given by 

S l = a ll + a 22 4 b a nn = + A 2 4 h A„ (5.27) 

It is called the trace of A and is indicated by tr A. Setting A is equal to zero, 
s„ is seen to be 

s„ ~ X x X 2 ... A„ = det /I (5.28) 

Thus, tr A and det A are both invariant to similarity transformations. 

Let us now restrict ourselves to the case in which A is a self-adjoint 
matrix. Then the discussion becomes similar to that for the eigenvalue 
problems of waveguides and cavities. Let x f and X, be an eigenvector and 
the corresponding eigenvalue, respectively. Then, we have by definition 

Ax,. - A.-X; = 0 (5.29) 

Multiplying by x,- + from the left, this becomes 

x, + Ax,. — A|X f + x f = 0 (5.30) 

Next, taking the adjoint matrix of (5.29) and then multiplying by x t from 
the right, we obtain 

x i + A + x i -X*x i + x i = 0 (5.31) 

Since A is self-adjoint, the first terms in (5.30) and (5.31) are equal to each 
other. Subtracting (5.31) from (5.30), we have 

(A/* — Aj) x, + x, = 0 (5.32) 

which shows that A ( - = X-*, since Xj + Xj# 0. In other words, the eigenvalues 
of a self-adjoint matrix are real. 

Let X,. and Xj be two eigenvectors with different eigenvalues. Multiplying 
(5.29) by x/ from the left, we have 

x/AX|- A|x/x, = 0 (5.33) 

After changing the subscript i in (5.29) to j and taking the adjoint matrix, 
if we multiply by x,- from the right, we obtain 

x/ A + x, - X* Xj + Xj = 0 



(5.34) 
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The first terms in (5.33) and (5.34) are equal to each other. Subtracting (5.33) 
from (5.34), we have 

a.-V)x/x, = 0 (5.35) 

Since Xj is real, X } * can be replaced by Xj which is different from X t by 
hypothesis. Thus, it is shown that 

x/x f = 0 (5.36) 

This is called the orthogonality relation between x f and Xj. The orthogonal 
relation between two vectors in real three-dimensional space can be ex- 
pressed in the same form, as is easily seen from (2.3) and (2.6) by setting 
0 = 90°. 

Let us assume that all the eigenvectors are multiplied by appropriate 
constants so as to satisfy 

* + *i = I (5.37) 



This is the normalization condition corresponding to (3.37) or (3.74). The 
normalization process, of course, does not change x f from being an eigen- 
vector. 

A matrix of order 1 x 1 is different from an ordinary number, but treating 
this matrix as if it were a number whose value is equal to that of the only 
component, Eq. (5.30) gives 



_x i + Ax i 

A i + 

X; X; 



(5.38) 



This suggests that 



, . x Ax 
X(x)= -f— 

X X 



(5.39) 



may be a variational expression for the eigenvalues. Since A is self-adjoint, 
we have 

* (x + Ax) + x + A + x x + Ax , x 

* (*) = V- + ^T- = + = — + — — X(x) 

(x X) X + X X X 

which shows that A(x) is real, regardless of the value of nonzero x. Suppose 
that X becomes X + bX when x becomes x + <5x. Multiplying (5.39) by the 
denominator on the right-hand side and taking the variation, we obtain 

A(<5x + x + x + <5x) + <5Ax + x = <5x + Ax + x + A <5x 

A little manipulation shows that this is equivalent to 

<52x + x = 5x + (Ax - Xx) + (Ax - Ax) + <5x (5.40) 




(5.40) 
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where we have used A = A + and X = X*. If x is an eigenvector, from (5.40), 
the first order variation 5X of X becomes zero. Conversely, if bX = 0 for all 
possible first order variations <5x from x, (5.40) shows that the real part of 
<5x + (Ax - Ax) must be zero, regardless of the direction of <5x + . However, if 
Ax — Xx is not a 0 matrix, by choosing a proper <5x + , the real part of 
<5x + (Ax — Ax) can be made nonzero leading to a contradiction. Therefore, 
if SX = 0 for all possible variations <5x drom x, we can conclude that x is an 
eigenvector and that X (x) gives the corresponding eigenvalue. This completes 
the proof that (5.39) is, indeed, a variational expression for the eigenvalues. 

Once the variational expression for X has been obtained, all the eigenvectors 
can be found sequentially, at least conceptionally. For example, we first 
obtain a vector x t which minimizes A(x). Then, we obtain x 2 which minimizes 
A(x) under the restriction of being orthogonal to x 1 . Since <5A = 0 for all 
(5x, Xj is obviously an eigenvector; x 2 is also an eigenvector which can be 
shown as follows. An arbitrary vector x can be written in the form 

x = (x/x) Xj + {x - (xj + x) Xj) (5.41) 

This shows that x can be expressed as the sum of two vectors, one parallel 
to x n and the other orthogonal to Xl . In that part of <5x which is orthogonal 
to Xj, bX is equal to zero, as can be seen from the method of obtaining x 2 . 
In that part of <5x which is parallel to x 1} <5 Xl , 

<5Ax 2 + x 2 = <Sxj + (Ax 2 - Xx 2 ) + (Ax 2 — Xx 2 ) + 5x x 

Since ^ is now orthogonal to x 2 , (5 Xl + Ax 2 and X*x 2 + bx t are both equal 
to zero. Furthermore, we have 

<5xj + Ax 2 = (A <5xj) 4 x 2 = Aj <5x 1 + x 2 = 0 

Similarly, (Ax 2 ) + <5x 1 is equal to zero. As a result, <5A = 0 for <5 Xl . We can 
conclude, therefore, that bX is equal to zero, regardless of the direction of 
<$x, and hence x 2 is an eigenvector. Similarly, under the restriction of being 
orthogonal to x u x 2 , ...,x i _ 1 , if we obtain a vector x t which minimizes 
A(x), then x- t becomes the ith eigenvector. When i becomes «, the number 
of the dimensions of the space under consideration, no further eigenvectors 
can be obtained, for in the n-dimensional space, there is no nonzero vector 
which is orthogonal to n orthogonal vectors. Therefore, the above procedure 
gives exactly n eigenvectors. 

As we stated in connection with (5.37), the eigenvectors are all assumed 
to be normalized. An arbitrary vector x in the ^-dimensional space can 
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therefore be expressed in the form 

x = f (x, + x) x,. (5.42) 

i= I 

which can be checked by multiplying x f + (i = 1, 2 ,..., n) from the left. 
Multiplying (5.42) by A from the left, we obtain 

Ax = £ (x i + x)2 i x i (5.43) 

i~ i 

where Ax, = A.-x* is used. From this, A can be written in the form 

A • = t /UV (5.44) 

i= 1 

This is called the spectral representation of A where the dot signifies that 
A is an operator. When none of the eigenvalues become zero. A' 1 exists 
and is given by 

A -1 •= f 1 XiX,. + 
i= 1 

The validity of this expression can be checked by substituting into (5.8). 

Let X be a matrix constructed by the eigenvectors x 1 ,x 2 ,...,x„ of a 
self-adjoint matrix A in the form 

X = [xj x 2 ... x„] 



Note that X is a square matrix. Because of the orthogonality condition 
between the eigenvectors, we have 



X + X 




(5.45) 



Taking the determinant of (5.45), the left-hand side becomes detX + detX, 
and the right-hand side becomes unity. From this we see that detX cannot 
be equal to zero, and hence X 1 exists. Multiplying (5.45) by X 1 from 
the right, we obtain 

X + =X -1 (5.46) 

which shows that X is a unitary matrix. Multiplying A by X + and X from 
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the left and right, respectively, we have 

X + AX = X + [A,x, k 2 x 2 ... A„x„] 

o - 

. — diag \_l x k 2 ... A„] (5.47) 



where Ax,- — 2,x i is used. The symbol on the right-hand side indicates the 
diagonal matrix with diagonal components k t , k 2 ,..., k„. Using (5.46), the 
left-hand side of (5.47) is seen to be the similarity transformation of A by 
X l . Thus, all the eigenvalues of A can be obtained from the similarity 
transformation by X' 1 . 

Now suppose that x + Bx vanishes regardless of the value of x (i.e., 
regardless of the values of the components of x), then we have 

Z x * B ijXj = 0 

u 

If we assume that the components of x are all zero, except for x t and x j9 
the above equation reduces to 

xfBjjXj + x/BjtX, = 0 

Setting x t = Xj = l, this becomes 

Bij + Bji = 0 

Similarly, setting = j and Xj = 1, we obtain 

~B ij + B ji = 0 

These two equations show that B t j = Bj t = 0, and since i and j are arbitrary, 
we can conclude that if x + Bx vanishes, regardless of the value of x, B must 
be a 0 matrix. 

Next, suppose that x + Ax is real regardless of the value of x. Since 
(x + Ax)' has the same value as x + Ax, we have x + Ax = x + A + x, or equiv- 
alently 

x + (A — A + ) x = 0 

This equation holds regardless of the value of x, hence A-A + must be a 
0 matrix; in other words, A is self-adjoint (A = A + ). Conversely, if A is 
self-adjoint, x + Ax = (x + Ax) + , and x + Ax becomes real regardless of the 



/l, 0 ... 

0 X 2 ... 

0 0 ...’ 
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value of x. Thus, A is a self-adjoint matrix if, and only if, x + Ax is real, 
regardless of the value of x. 

If x + Ax is always positive for nonzero x, A is said to be positive-definite. 
If A is positive-definite, it is, of course, self-adjoint for positive numbers are 
real. It is also obvious from (5.39) that, if A is positive-definite, the eigen- 
values are all positive. Conversely, if the eigenvalues are all positive, A is 
positive-definite. This can be seen as follows from (5.42) and (5.43) 

x+Ax = I ( x+x .) M x » +x ) = £ X f |x ; + x| 2 

i i 

which is always positive for nonzero x. Thus, A is positive-definite if, and 
only if, the eigenvalues are all positive. 

Suppose that A is positive-definite, then a square matrix T can be defined 
by 

T = [Ar 1/2 Xl AJ 1/2 x 2 ...2“ 1/2 x„] 

where 2 4 #0 (i = 1, 2, ...,«) is used. It can easily be seen that T is non- 
singular since T + T = diag [2^ 1 2J 1 ...2„‘" 1 ], and detT#0. Furthermore, 
using (5.47), we have 

T + AT = I 

Conversely, suppose that T + AT forms a unit matrix for a particular T, then 

y + T + ATy = y + y 

is always positive for nonzero y. Setting x = Ty and noting that x can take 
any nonzero value, since y is an arbitrary nonzero vector and T is non- 
singular, x + Ax is then seen be to positive for any nonzero x. Thus, A is 
positive-definite if, and only if, there exists a nonsingular matrix T which 
makes T + AT a unit matrix. 

Another necessary and sufficient condition for a self-adjoint matrix A to 
be positive-definite is that the components of A satisfy 

a a fl u a in 

a u > 0, 11 12 >0, ..., * : >0 (5.48) 

a 21 22 

a nl a nn 

The conditions given in (5.48) do not require the knowledge of the eigen- 
values, and hence they provide a practical method for checking the positive- 
definiteness of a self-adjoint matrix. Although the proof is lengthy, it will 
be given below. 

When A is a square matrix of order I, the necessary and sufficient condi- 
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tion for A to be positive-definite is obviously given by a n > 0. It will be 
appropriate, therefore, to attempt a proof for the general case by mathe- 
matical induction. First, let us prove the sufficiency. Let A„_ x be a square 
matrix of order n — 1 with the first n— 1 rows and n — 1 columns of A. 
When the first n -1 conditions in (5.48) are satisfied, assume that A„_ x is 
positive-definite; i.e., there exists a nonsingular matrix T n _ x satisfying 



T«"- 1 A B - 1 T lt _ i =1 



(5.49) 



With this assumption, if A can be proved to be positive-definite using the 
last condition in (5.48), then (5.48) is sufficient. Let us write x and A in the 
forms 



then we have 



= f x » - 1 1 a = A - f An_1 a "“ 1 

’ A ~ A n ~ + 

A n “n - 1 a nn 



'ifc: 'dh;'] 

— x„_ 1 A n _ 1 x n _ j + x„x n _ 1 a„_ j + x n a n _ 1 x n _ 1 + o nn x n x n 



(5.50) 



Let y be a vector defined by means of the relation 



-MTv 



("1 [y n -i] _ 



= L„y = Ly (5.51) 



Since detL = detT„_ x and T rt _ x is nonsingular, L is also nonsingular. With 
this y, (5.50) can be rewritten in the form 

y + L + ALy = (y„_j — T^_,a„_ 1 y (1 ) + t 1 A B . l T„. 1 (y B . 1 -t lVl ^) 

+ y n ( y n _ x - T^a,,^ 

+ y„*a n + _ 1 T„_ 1 (y„_ x - T„ + _ ,a„_ x y n ) + a nn y„*y n 

The right-hand side can be simplified using (5.49) which results in 

y + L + ALy = y^.y^, + (a nn - y«*y„ (5.52) 

This shows that L + AL is a diagonal matrix given by 

L + AL = diag [1 1...1 a nn — a^_ ,T„_ X T^_ ,a n _ ,] (5.53) 

where the first n — 1 diagonal components are all unity. Taking the deter- 
minant of (5.53), we have 

(det L + ) (det A) (det L) = |detL| 2 detA = a nn - a„ + _ jT m _ ,T n + _ ,a„_ t 



(5.54) 
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From the last condition in (5.48), detA is positive; hence, the right-hand 
side of (5.54) must be positive. Again, introducing a new vector z defined 
by 



= 0 fa -a + T°T* a V' 2 ] PH =M ~‘ y ( 5 ' 55 > 

L U \ U nn “n — 1 1 n — 1 ■* n — 1 “n — 1 / /n 



where M 1 is clearly nonsingular, (5.52) becomes 



z + M + L + ALMz = z + z 



(5.56) 



Noting that x is an arbitrary nonzero vector, which means that y and z are 
both arbitrary, (5.56) indicates that 

M + L + ALM = I (5.57) 

If we set LM =T, then since T is nonsingular, (5.57) shows that a non- 
singular matrix T exists which makes T + AT a unit matrix. Therefore, A 
is positive-definite, and the proof of sufficiency is complete. 

Let us next consider the necessity of (5.48). If a n < 0, then x + Ax cannot 
be positive for a particular x + given by [1 0 0...0]. Therefore, o n >0 
is a necessary condition for A to be positive-definite. To apply the principle 
of mathematical induction, let us assume that the first m — 1 conditions in 
(5.48) hold, but the /wth condition does not. Then, from the sufficiency of 
the first m — 1 conditions, there exists a nonsingular matrix T m _, satisfying 

T«- 1 A jii _ 1 T w _ 1 = I (5.58) 

Define a square matrix L„, similar to L„ in (5.51) using T m _ 1 and a m _j in 
place of T n _ l and a n _ l5 respectively. Then, L m is nonsingular, and we have 

L:A m L m = diag[l 1 ... 1 (5.59) 

If det A m < 0, a mm - a+_ I T ni _ 1 T n ^_ 1 a m _ 1 can be proved nonpositive (nega- 
tive or zero) using an expression similar to (5.54). Let x m = L m y m and 
y m + = [0 0... 0 1]. Then, for x + given by 

x + =[xj, 0 0 ... 0] 

where the last n - m components are all zero, we have 

X+ Ax = x„~A m x„, = y+L+A m L m y m = a mm - aJ;_ 1 T m .,T ) ;_ I a ra _ 1 

which is nonpositive. Therefore, the wth conditon is also necessary for the 
positive definiteness. Since m is arbitrary, all the conditions in (5.48) are 
necessary, and the proof is complete. 




If the range of x + Ax for arbitrary nonzero x includes zero and otherwise 
remains positive, A is said to be semipositive-definite. Matrix A is semi- 
positive-definite if, and only if, at least one of the eigenvalues is zero and the 
remainder are not negative. From (5.28), det A is equal to zero in this case, 
and A -1 does not exist. 

Now, suppose that there are two self-adjoint matrices, A and B, and A is 
positive-definite, then from the positive-definiteness of A, there is a non- 
singular matrix T which satisfies T AT = 1. Since 



(T + BT) + = T + BT = T + BT 



then T + BT is self-adjoint. Let x l5 x 2 ..., x n be the normalized eigenvectors of 
T + BT, and let X be a square matrix constructed from them, i.e., X = [xj 
X 2 -” X J. Then, X + T + BTX must be diag[2 t X 2 ...X„~] where 2,0 = 1, 2,..., n) 
is the eigenvalue corresponding to x ? . On the other hand, we obviously have 

X + T + ATX = XIX = X + X = I 



therefore, if we define H by H = TX, H is nonsingular and 

H + AH = I (5.60) 

H + BH = diag [2, X 2 ... 2„] (5.61) 

The above discussion shows that if there are two self-adjoint matrices A 
and B, and one of them, say A, is positive-definite, then there is a non- 
singular matrix H which diagonalizes both H + AH and H + BH, simul- 
taneously; i.e., all of their components except those on the main diagonal 
are made zero. This fact will be used in the noise discussion of linear ampli- 
fiers in Chapter 7. 

In the above discussion, H + is not necessarily equal to H“\ From (5.60), 
we have 

H + = (AH) - 1 =H" ! A _1 
Substituting this into (5.61), we obtain 

H- , A" 1 BH = diag[2 1 X 2 ... 2„] 

The similarity transformation of A ! B by H _1 gives the eigenvalues 
2i, X 2 ,..., 2„. However, since similarity transformations do not change the 
eigenvalues of a matrix, as explained in connection with (5.24) and (5.25), 
Xi,X 2 ,...,X„ must also be eigenvalues of A _1 B. Consequently, X u X 2 ,..., X„ 
in (5.61) can be obtained by calculating the eigenvalues of A _, B. Both A 
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and B are self-adjoint, but A -1 B is not necessarily so. Nevertheless, its 
eigenvalues are all real because \ Xt X 2 ,..., X„ are the eigenvalues of a self- 
adjoint matrix T + BT. 

Finally, let us consider a necessary and sufficient condition for two different 
self-adjoint matrices A and B to be simultaneously diagonalized by a unitary 
matrix through the similarity transformation. Although the discussion is 
instructive and worth presenting, we shall not utilize the result in this book, 
and one may omit the remainder of this section on a first reading and 
proceed to Section 5.2. 

Suppose that the similarity transformations of two different self-adjoint 
matrices A and B by a unitary matrix give diagonal matrices A' and B', 
respectively. The order of the product of two diagonal matrices can be 
interchanged without changing the result; i.e., 

A B = BA 

The forms of matrix equations are invariant to similarity transformations, 
and hence we have 

AB = BA (5.62) "} 

The above discussion shows that (5.62) is a necessary condition for A and B j 

to be simultaneously diagonalized by a unitary matrix through the similarity £ 

transformation. The proof that (5.62) is also sufficient will be given below. j 

We first assume that (5.62) holds, and prove that there is a unitary matrix 
which* diagonalizes both A and B through the similarity transformation. 

Let x l5 x 2 , x. be the normalized eigenvectors of A, and let X be a matrix f 

% 

constructed from them; i.e., ■?. 

■ | 

X = [Xi x 2 ... x„] | 

Then, we have | 

X _1 AX = X + AX = diag[A 1 X 2 ... X„] (5.63) | 

.1 

where X t is the eigenvalue corresponding to X;. For convenience, we assume $ 

that X x < X 2 <A 3 < ••• < A„. In order to calculate X -1 BX, let us consider 
x, + ABx ; . From (5.62), x+ ABx,- is equal to x+BAx,. From the fact that 
both A and B are self-adjoint, we have > 

x, f ABxy = (A x ( -) + Bx ; = 2,-Xf‘BXj- = x^BAx, = x^B^Xj = A J x i + Bx J - 

It follows from this that, if A, ^Xj , I 

X i + Bxj = 0 f 
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In other words, X + BX must have the form 

Bi 0 0 

0 B 2 0 
0 0 B 3 

where B* (/ = 1, 2,...) is a square matrix with the number of columns (and 
rows) equal to the degree of the degeneracy of the /th smallest eigenvalue of 
A, where the /th smallest eigenvalue is such that there are / - 1 discrete 
eigenvalues smaller than it. The largest subscript is equal to the number of 
different eigenvalues of A. Since (X + BX) + = X + B + X = X + BX, we have 

Bj + = B, 

which shows that the B/s are also self-adjoint. Let Y, be a unitary matrix 
constructed from the normalized eigenvectors of B, in an usual manner. 
Then the similarity transformation of B, by Y, + gives a diagonal matrix. 
Let Y be a square matrix defined by 






Then Y itself becomes a unitary matrix. From the method of constructing Y, 
it is obvious that Y + X + BXY is a diagonal matrix and that Y + X + AXY 
becomes diag [X x X 2 ...X n ] from (5.63). Noting that XY is a product of 
two unitary matrices, it must be a unitary matrix, and therefore we see that 
(XY) 1 =Y + X + is a unitary matrix. Thus, both A and B are shown to be 
diagonalized through the similarity transformations by the unitary matrix 
Y + X + . This completes the proof. 



5.2 Reciprocal Conditions 

Suppose that several waveguides are connected together to form a wave- 
guide junction as schematically shown in Fig. 5.1. At a reference plane far 
from the discontinuities in each waveguide, all the higher modes with fre- 
quencies below cutoff can be neglected, and only a finite number of prop- 
agating modes have to be considered. At such a reference plane, the voltage 
and current can be defined for each propagating mode as we explained in 
Chapter 3. Since Maxwell’s equations indicate a linear relation between the 
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Fig. 5.1 . Waveguide junction. 



electric and magnetic fields, there should be a linear relation between the 
voltages and currents which represent the fields. Let us consider the case 
in which all currents are given independently, and the voltages are uniquely 
determined by them. In such a case, we have 



= Zu/j + z 12 / 2 + ••• + Z ln I„ 

^2 = %2lh + Z.22^2 d b Z 2 Jn ^ 

K - Z nll l + 7' nil 2 H b Z„J H 

where K,- and /, (/ = 1, 2, n) are the voltage and current for a propagating 
mode at the reference plane in a waveguide shown in Fig. 5.1. When two 
or more propagating modes exist in some of the waveguides, n becomes 
larger than the number of actual openings of the junction. Since we are 
interested in the behavior of the waveguide junction as viewed from outside 
the reference planes, from (5.64) the junction as a whole can be considered 
as an «-port network. We can now use the concept of matrices explained 
in the previous section and define v, i, and Z by 
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respectively. Then (5.64) can be written in a particularly simple form, 

v = Zi (5.66) 

Similarly, if the voltages can be given independently of each other, and 
the currents are uniquely determined by them, we have 

i = Yv (5.67) 

If Z is nonsingular, we have from (5.66) and (5.67) 

Z -1 =Y (5.68) 

In order to study relations between components of the matrix Z, let E (1) 
and H (1) be the electric and magnetic fields inside the junction corre- 
sponding to a set of currents given at the ports. Similarly, let E (2) and H (2) 
be the fields corresponding to another set of currents. Using Maxwell’s 
equations 

VxH = (d+ jws) E , V x E = - ycu/iH 

we have 

V-(E (1) x H (2) -E (2) x H (1) ) 

= H (2) V x E (1> — E (1) • V x H <2) - H (1) * V x E (2) + E <2) -V x H (,) 
= - jcoH {2) ./jH (,) - E (1> .(a + jax) E (2) 

+ j(o H (1) - i uH (2) + E (2) • (a + jm) E (l) (5.69) 

Since (a +y'a>£) and p are ordinary scalar quantities and the order of a scalar 
product of vectors is interchangeable, the terms on the right-hand side 
cancel. If we integrate (5.69) over the volume of the waveguide junction 
enclosed by S shown in Fig. 5.1, we obtain 

J s ( E (1) x H (2) -E (2> x H (1) ) -n dS = 0 (5.70) 

where use is made of Gauss’s theorem to convert the volume integral to a 
surface integral. The contribution to the integral comes from that part of S 
consisting of the reference planes in the waveguides only. Let us now assume 
that E (l) and H U) correspond to a set of currents which are all zero except 
// 1} at the /th port. Similarly, let E (2) andH (2) correspond to a set of currents 
which are zero except /j 2) at the y'th port. For the surface integral in (5.70), 
we then only have to consider the contributions from the /th and jth ports. 
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The integral of the first term in the bracket becomes 



J E (,) x H <2) n dS = J E tJ Vj l) x k x E (J /j 2) -n dS 
= Fj n /j 2 ) k*n = - kj 1) /j 2) 

where E tj is the waveguide eigenfunction at the jth port, and the normal- 
ization condition for E (J - is used. The integral of the second term can be 
calculated in a similar manner, and (5.70) reduces to 

- Vj' ) I i J 2) + V i (2, i f (,) = 0 (5.71) 

in this particular case. Since the /’s with superscript two are all zero except 
Ij 2) , Vj- 2) can be expressed by Z ij /j 2) from (5.64). Similarly, Vj is given by 
Z }i l\ l) . Substituting into (5.71), we have 

(Zy — Zji) = 0 

Noting that 7 { (1) and lj 2) can be specified arbitrarily, we must conclude 



or equivalently, 



Z,7 = Z/i 



z = z, 



(5.72) 



The expression (5.72) is known as the reciprocal condition in ac circuit 
theory. The «-port network representing the waveguide junction is, therefore, 
reciprocal. 

In much the same way, given the voltages instead of the currents at the 
ports, we obtain 

Y = Y, (5.73) 

Of course, if Z is nonsingular, (5.73) also follows from (5.68) and (5.72). 

In the above discussion, we considered reference planes far from wave- 
guide discontinuities; however, (5.64) has broader applications. For example, 
when there are several pairs of terminals inside the waveguide junction, 
the terminals in each pair being closely spaced compared to a wavelength 
[A = 27r/co(£^) 1/2 ], then the voltage between the terminals and the current 
flowing through them can be defined with little ambiguity. This follows 
because the electric field satisfies V x V x E — (o 2 ef.iE = 0, assuming that a 
is negligible, and that the first term on the left-hand side is made up of 
second derivatives which are individually much greater than the second 
term in the vicinity of the pair of terminals where field variation is large. 
As a result, the field should approximately satisfy V x V x E = 0 which 




is an equation for the static field. The voltage can be defined as the line 
integral of the electric field if the field resembles the static one. Some of the 
voltages and corresponding currents in (5.64) can, therefore, be considered 
to represent the conventional voltages and currents at the pairs of terminals. 
The linear relations and the reciprocal conditions (5.72) and (5.73) still hold 
due to the particular form of Maxwell’s equations which have been used. 
However, a slight modification is required in the evaluation of the surface 
integral leading to (5.71). Let the yth port be the one representing such a 
pair of terminals, and let Sj be the closed surface enclosing a small volume 
in the vicinity of the pair of terminals as shown in Fig. 5.2. In order to evaluate 
the surface integral, we choose the elementary surface 

n dS = — e d£ x h drj 




Fig. 5.2. A pair of terminals in junction and Sj. 



where e and h are unit vectors in the direction of E (1> and perpendicular to 
it, respectively. Then we have 



f E (1) x H (2) *n dS = — f E (1) x H ( 2 ) .ed£ x h dt] 

J S J Sj 

= -JJ E ( 1 ) . e ^H ( 2 ) -h^ = — Vj'Wj 



from which (5.71) follows. 

From the above discussion, we see that if a waveguide junction includes a 
small nonlinear device such as a semiconductor diode, then the junction 
can be treated as an /?-port linear network with the device connected to one 
of the ports. This enables us to handle the nonlinearity of the device separately 
from the remainder of the microwave circuit. The /i-port network in this 
case represents the junction which excludes a small volume occupied by the 
active nonlinear part of the device. 
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Now let us consider a linear transformation of the voltage and current 
given by 

fl, = i|R e Z ( | ,/2 (K + Z,/,) ( b, = $ |ReZ,| 1,2 (K, — Z,*/,) (5.74) 

where R eZ, is assumed to have a nonzero value. If the reference impedance 
Z; is the impedance looking out from the ith port of the junction, and 6, 
become the power waves introduced in Section 1 .4. If Z f is real and positive, 
and represents the characteristic impedance of the waveguide mode to 
which the ith port is assigned, then a t and b t are the traveling waves at the 
reference plane. Let a and b be the vectors with the ith component being 
a t and b„ respectively. If we define F and G by 

F = diag IReZJ “ 1/2 4|ReZ 2 r«' 2 ... }|ReZ„r*' 2 ] 

G = diag[Z, Z 2 ... ZJ (5.75) 

we have 

a=F(v + Gi), b = F(v — G + i) (5.76) 

Since v and i are related linearly, there must be a linear relation between a 
and b; let us write it in the form 

b = Sa (5.77) 

where S is a square matrix. This S is called the scattering matrix. Substituting 
(5.76) into (5.77), we have 

F(v — G + i) = SF(v + Gi) 

Using v = Zi, this reduces to 

F(Z — G + )i = SF(Z + G) i 

from which S can be obtained in terms of Z, 

S = F(Z— G + )(Z + G)~’ F~ l (5.78) 

Similarly, Z can be expressed in terms of S, 

Z= F" , (I-Sr 1 (SG + G + )F (5.79) 

The reciprocal condition of the junction in terms of S is given by 
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We shall now derive this formula from Z = Z,. First, we note that P is a 
diagonal matrix whose ith diagonal component 



ReZ, 
Pi = i Re ZT| 



(5.82) 



is + 1 or — 1 depending on whether the real part of Z x is positive or negative, 
respectively. Therefore, P is nonsingular, and P = P“ 1 . Substituting (5.79) 
into Z = Z„ we have 

F" 1 (I - S)- 1 (SG + G + ) F = F(GS, + G + ) (I - S t )~ 1 F _1 
This is equivalent to 

(SG + G + ) F 2 (I - S t ) = (I - S) F 2 (GS, + G + ) 

Expanding this and subtracting G + F 2 - SGF 2 S, from both sides, we have 



SGF 2 - G + F 2 S f = F 2 GS, - SF 2 G + 

or equivalently, 

SF 2 (G + G + ) = F 2 (G + G + ) s, 

However, since 

F 2 (G + G + ) = iP = iP _1 



the above equation gives (5.80), which is what we wished to derive. 
When the signs of ReZ,. and ReZ ; are the same, (5.80) gives 

Sij = Sji 

and when they are opposite, it gives 



Su = -Sj t 

In either case, we have 

|S,-/ = |S 2 / (5.83) 

Suppose that the a? s and the b’s represent power waves and none of the 
circuits contains a source, except the one connected to the ith port of the 
network. The power from the yth circuit to the network is given by 
Pj(\aj \ 2 -\bj\ 2 ) and aj(j^i) is equal to zero in this case. Therefore, the 
power to theyth circuit from the network is given by Pj\bj\ 2 . Furthermore, 
b j is equal to Sjfii in the present case, and hence the ratio of the net power 
Pj\bj \ 2 into the /th circuit to the exchangeable power from the ith 

circuit is equal to p ( p j\S . Since (5.83) indicates that the value of this ratio 
does not change when the subscripts i and j are interchanged, we can 
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conclude that the relation between the net power into a load and the 
exchangeable power from the source remains the same in a reciprocal 
network when the roles of source and load are interchanged. This is a power 
reciprocal relation. Note in the above interchange of generator and load 
relation, only the location of the voltage source is changed while the 
impedances connected to the ports are kept constant. 

When the a ? s and b;'s express traveling waves along the waveguides, 
(5.80) reduces to 

S = S t 

since P = I in this case. Although the relation (5.83) remains the same, the 
interpretation is different. Assume that the ith port is the only one with a 
nonzero incoming wave, then |Sj,| 2 is the ratio of the power emerging from 
they'th port to the incoming power at the ith port. This ratio remains con- 
stant when the roles of the ith and jth ports are interchanged. However, if 
the circuit connected to some port, say the kth port, is not matched to the 
characteristic impedance of the line, a reflection takes place resulting in an 
incoming wave, a k which invalidates the assumption. Therefore, the power 
reciprocal relation \S U \ 2 = isy 2 for traveling waves has only limited 
applications. 

5.3 Lossless Conditions 

When losses in a waveguide junction are small, it is convenient to consider 
a model in which the effect of losses is completely neglected. Let us consider 
the lossless conditions which Z, Y, and S must satisfy when these represent 
such an idealized model. Since the net power entering from the ith port is 
given by RcVjlf, the total power entering from all ports becomes 

X *e(*v*) = I HVih* + V*h) = *(I + T + v + i) = ±i + (Z + Z + ) i (5.84) 

where the summations are over all possible i*s and use is made of (5.66). 
For the lossless network, the total power must be zero regardless of the value 
of i. Thus, we obtain 

Z + +Z = 0 (5.85) 

It is obvious from the derivation that this is a necessary and sufficient con- 
dition for Z to express a lossless network. For one-port networks, (5.85) 
becomes ReZ = 0, which shows that the real part of the input impedance 
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is equal to zero. If we use (5.67) instead of (5.66), we obtain from (5.84) 

Y + Y + = 0 (5.86) 

This is also necessary and sufficient for Y to represent a lossless network. 

From (5.84), we find that Z tJ = - Z* for a lossless network. Since Z j{ = Z i} 
for a reciprocal network, it follows that Z 0 - = - Z* for a lossless reciprocal 
network. This means that all the Z ( -/s are pure imaginary in this case. 
Similarly, all the Y f /s are pure imaginary for a lossless reciprocal network. 

For a general passive network, since the total power expressed by (5.84) 
must not be negative, Z + Z + and Y + Y + have to be either positive-definite 
or semipositive-definite. 

Next, let us consider the lossless condition in terms of S. The net power 
into the ith port of a network is given by pi{\a-\ 2 - |6,-| 2 ). Therefore, the total 
power is given by 

I Pi{Wi \ 2 - \bi\ 2 ) = a + Pa - b + Pb = a + Pa - a + S + PSa = a + (P - S + PS) a 

where use is made of (5.77). This must be zero regardless of the value of a, 
and hence the lossless condition is given by 

S + PS = P (5.87) 

When the ReZ f ’s are all positive, P = I, and (5.87) shows that S must be 
an unitary matrix. 

Multiplying (5.87) by P from the right and utilizing (5.80), we find that 
S + S f = I for a lossless reciprocal network. Taking the transposed matrix, 
this becomes 

SS* = 1 

For a general passive network, since the total input power must not be 
negative, P — S + PS has to be either positive-definite or semipositive- 
definite. 

For a simple example, let us consider a lossless two-port network. Equa- 
tion (5.87) gives three independent conditions 

Pi l^i il 2 + Pz i^ 2 i | 2 = Pi 

PiS u S* 2 + P 2 S 2I S* 2 = 0 (5.88) 

Pi 1 ^ 12 ! + Pz 1 ^ 22 ! 2 = Pz 
From the second condition, we have 



|S„I 2 |S 12 | 2 = |S 21 | 2 |S 22 | 2 
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Combining the first and last conditions in (5.88) with this equation, we 
obtain 

foMO - IS22I 2 ) (Sul 2 = (P1/P2X 1 - ISul 2 ) |S 22 | 2 
which is equivalent to 

|5 11 | 2 = |S 22 | 2 

When S represents the power wave scattering matrix, this relation shows 
that the power reflection coefficient at one port is equal to that at the other 
port for a lossless two-port network. From this we conclude that the power 
reflection coefficient as well as the power transmission coefficient remains 
the same regardless of the position of the reference plane selected along a 
lossless transmission system. The fact that the exchangeable power is 
preserved during a nonsingular lossless transformation can also be easily 
shown using (5.88). Assume that a 2 is zero for the moment, then the 
exchangeable power from port 2 is given by 

P2 |fr 2 l _ Pi\^2i \ 2 \ a i \ 2 _ . |2 

1 - |S 2 J 2 “ ”l -ISnl 2 " ' fll1 

where we have used the first condition in (5.88). The right-hand side is 
exactly equal to the exchangeable power from the source connected to port 1. 
We have, therefore, shown that the exchangeable powers are the same at 
the input and output ports of a lossless two-port network, provided that 
I'S'nl 2 = |S 22 I 2 ^ 1- If 1‘S’iil 2 = l‘S' 22 | 2 = 1, the input and output ports are 
effectively disconnected inside the network, and the transformation from 
the input port to the output port is said to be singular. 

5.4 Frequency Characteristic of Lossless Reciprocal Junctions 

Let us first consider Z and Y representing a lossless reciprocal junction. 
Maxwell’s equations are given by 

VxE = - jcopH (5.89) 

V x H = j(osE (5.90) 

Differentiating with respect to co, we have 




V x (<3E/oa>) = — jpH — jcop (dHjdco) 

V x (dH/dco) = je E + jcoe (dEjdco) 



(5.91) 

(5.92) 
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Subtracting (dEjdco) • times the complex conjugate of (5.90) from H* ♦ (5.91), 
we have 

V • {{dEjdco) x H*} = - jp H* • H - jcop(dHjdco) • H* + jcoe E* • (dE/dco) 

(5.93) 

Similarly, we have from (5.89) and (5.92) 

V • {(dH/dco) x E*} = je E* - E + jcoe (dEjdco) • E* - jcopH* • (dHjdco) 

(5.94) 

Subtracting (5.94) from (5.93), we obtain 

V • {(dEjdco) x H* - (dH/dco) x E*} = - j (pH* • H + eE* • E) (5.95) 

Let us integrate (5.95) throughout the volume within 5 in Fig. 5.1. The 
left integral becomes a surface integral over the waveguide cross section 
only, and is given by 

J {(dEjdco) x H* - (dHjdco) x E*} *n dS 

= I J i^tiiSVJdco) x (k x E ti ) I* - (k x E ti ) (dljdca) x E„F ; *} -n dS 

= - Z {(Wildoj) 1* + (dljdco) V*} = — {i + (dv/dco) + v + (di/dco)} 

where E fi indicates the normalized eigenfunction of a waveguide propagating 
mode. The volume integral of (5.95), therefore gives 

(i + (d\jdco) + v + (dijdco)} =j J (pH* • H + eE* • E) dv (5.96) 
Using v = Zi, the left-hand side is rewritten in the form 



{i + (dZjdco) i + i + Z (dijdco) + i + Z + (di/dco)} 

= {i + (dZjdco) i + i + (Z + Z + ) (dijdco)} 



However, the second term on the right-hand side vanishes because of 
(5.85), and (5.96) therefore reduces to 



i + (dZjdco) i = j J (pH* • H + sE* * E) dv (5.97) 



s 

( 

i 

i 



For a lossless reciprocal junction, all the components of Z are pure imaginary, 
and if we write Z =jX , the components of X will be real. Substituting this 
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into (5.97), we have 

i + (dX/dco) i = J (pH* • H + eE* • E) dv (5.98) 

This shows that dX/dco is positive definite, since the right-hand side is always 
positive for nonzero fields. If we use i=Yv instead of v = Zi, we obtain 
from (5.96) 

v + (dB Idea) v = J (pH* • H + eE* • E) dv (5.99) 

where Y = jB is used. This shows that dB/dco is also positive definite. 

For one-port networks, the above relations indicate that the derivatives 
with respect to co of the input reactance X, and susceptance B, are always 
positive; i.e., 

(dX/dco) > 0, (dB/dco) > 0 

which correspond to Foster’s reactance theorem. As illustrated in Fig. 5.3, 
X and B increase with co almost everywhere. 




Fig. 5.3. Frequency characteristic of pure reactance X or pure susceptance B. 



\ 




Next, let us consider the relation for S corresponding to (5.98) or (5.99). 

To do so, we express v and i in terms of a and b and substitute into (5.96). 

We have from (5.76) I 

a — b = F(G + G + )i | 

Multiplying by 2FP from the left and using 2FPF(G + G + ) = I, we obtain 

i = 2FP(a — b) (5.100) J 
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A combination of (5.76) and (5.100) gives 

a + b = 2Fv + F(G - G + ) i = 2Fv + 2FPF(G - G ) (a - b) 

from which v can be obtained in terms of a and b. If use is made of (2F)~* = 
PF(G+ G '), the expression for v is simplified as follows: 

v = (2F) -1 (a + b) - PF(G - G + ) (a - b) 

= PF((G + G + ) (a + b) - (G - G + ) (a - b)} 

= 2PF(G + a + Gb) (5.101) 

Now, let us substitute (5.100) and (5.101) in the left-hand side of (5.96). 
After a little manipulation, we have 

(i + (folded) + v + (di/dco)} = 4 [a + (F + (dF G + /dco) + GF + (dF/dco)} a 

+ a + (F + FG + + GF + F) (dtt/dco) 

- b + (F + (dFG/dco) + G + F + (dF/dco)} b 

- b + (F + FG + G + F + F)(ab/am)] (5.102) 

The second and fourth terms on the right-hand side can be simplified using 
the relations, 

F + FG + +GF + F = iP, F + FG + G + F + F = ip (5.103) 

The expressions within the brackets of the first and third terms on the right- 
hand side of (5.102) are more complicated. Let us calculate the /th diagonal 
component of the first one. Since the /th diagonal component of dF/dco is given 
by 

i (5 1 Re 2, -| ‘ 1/2 /dco) = - i | Re Z,\~ 1 12 \ Re Z\~ 1 (d | Re Z^/dco) 

= — i|ReZ,r 1 / 2 \Z* + zr l m*+Z t \lda) 

the /th diagonal component of 

{F + (dF G + /dco) + GF + (dF/dco)} = F(G + + G) (dF/dco) + F + F(dG + /dco) 
becomes 

- i | Re Z,r 1 {d (Z* + ZJ/Ab) + i | Re Z,r 1 («,*/&)) 

= ilz,* + z,r 1 {a(z j *-z,)/a<o} 

Let K be a diagonal matrix whose /th component is given by 

\z i *+z i \- 1 {d(z i *-z i )/dw} 
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then we have 

{F + (dF G + /dco) + GF + (dF jdco)} = *K (5.104) 

Noting that K is pure imaginary, the expression within the brackets of the 
third term on the right-hand side of (5.102) becomes 

(F + (dF G/dco) + G + F + (dF/dco)} = ±K + = - (5.105) 

Substituting (5.103), (5.104), and (5.105) into (5.102), we have 

(i + (d\/dco) + v + (dilda))} = a + Ka + b + Kb + 2 {a + P(da/do>) - b + P(db/doj)} 

(5.106) 

Using b = Sa, the second term on the right-hand side becomes a + S + KSa. 
Furthermore, the expression within the bracket of the last term can be 
rewritten in the form 

(a + P(da/d«) - b + P(db/dco)} = a + P(da /do) - a + S + P(5S a /dco) 

= a + P(da/0w) - a + S + P(fiS/da)) a (5.107) 
- a + S + PS(3a/do>) 

The first and the last terms cancel each other because of the lossless con- 
dition (5.87), and only the second term remains. Consequently, (5.96) 
becomes 

a + (K + S + KS - 2S + P(dS/dco)} a = j J (pH* • H + eE* • E) dv 

Dividing both sides by j, and using — K = K + , we obtain 

a + j{K + + S + K + S + 2S + P(dS Idea)} a = J (pH *-H + eE*-E)dv 

(5.108) 

This shows that j{K + + S + K + S + 2S + P (dSjdco)} is positive-definite. 

When all the Z- s are real, K = 0, and (5.108) reduces to 

a + 2yS + P(<3S/dco) a = J (/iH* ♦ H + eE* • E) dv (5.109) 

When the Z- s express the real characteristic impedances of the waveguide 
propagating modes, P = I, and we have 

a + 2/'S + (5S Ido)) a = J (pH* H + sE* • E) dv 

In order to see the physical meaning of (5.108), let us consider the simplest 
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case where a lossless reciprocal one-port network is connected to a generator 
having an internal impedance with a positive real part. Let the incident 
wave be the addition of two waves having equal amplitudes at frequencies 
co and co + Aco 

a(co) = Ae J<0 ' + Ae i(a,+4 "* 

Taking the real part and multiplying by ^2, the actual wave as a function 
of time is given by 

a (t) = yjlA cos cot + yj2A cos(cu -I- Aco) t 
= 2y]2A cos(i Aco t) cos(co + \ Aco) 1 

The slowly varying function 2^/2 A cos (%Aco /) expresses the envelope of 
the waveform. 

We shall now consider the reflected wave. Since the magnitude of S lt 
is unity, from the lossless condition, S n can be written in the form 

5 n =e -MO (5no) 

If we write <p(co + Aco) = cp + Acp, the reflected waves becomes 

6(o>) = yle ,<£0t-ip) + + Jw)t-(<p + J<p) 

This represents an actual wave as a function of time which is given by 
b(t) = 2 yj2A cos {\ (Aco t — Acp)} cos {(gj + $ Aco) t - cp) 

The envelope of b(t) lags behind that of a(t) by Acp/ Aco. If no energy is to 
be transferred through the nodal points of the envelope, as we discussed in 
Section 3.2, the above result indicates that the energy is reflected back at a 
time Acp/Aco after its entrance into the network. Let r be the limiting value 
of Acp/Aco when Aco approaches zero, then r becomes the time delay required 
for incident energy with angular frequency co to enter the network and leave 
it again. Strictly speaking, (2nn + Acp)/ Aco has to be considered as the time 
delay, where n is an integer since it is not certain which nodal points in the 
incident and reflected waves correspond to each other. However, if n ± 0, 
(2nn + Acp)/ Aco becomes infinite as Aco approaches zero, and it cannot be 
considered as an appropriate value for the time delay. Therefore, n was 
chosen to be zero in the above discussion. From (5.109) and (5.110), x is 
given by 

r = (dcpjdco) (dS ll /dco) = ±(a*a)~ l f (/^H* ♦ H + £ E* -E) dv 

J (5.111) 
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The time delay is equal to the total stored energy in the network divided 
by the incident power. The larger the stored energy per unit incident power, 
the longer the delay. 



5.5 Lossless Reciprocal Three-port Networks 

To illustrate the utilization of some of the results obtained for the 
impedance matrix in Sections 5.2 and 5.3, let us derive an equivalent circuit 
for lossless, reciprocal, three-port networks. In general, a three-port network 
is characterized by a matrix of order 3x3, and is therefore determined by 
nine complex numbers. However, since Z tJ = Z n for reciprocal networks, 
only six complex numbers, or equivalently, twelve real numbers are sufficient 
to determine a reciprocal three-port network; furthermore, all the Z f /s 
are pure imaginary for lossless networks. Consequently, a lossless reciprocal 
three-port network is specified by only six real numbers. Let us choose these 
six real numbers to be X lu X l2 ,X l3 , X 22 ,X 23 > andX 33 where X u = -jZ u . 

For the moment, we shall assume that X 12 , X 13 , andZ 23 do not vanish. 
Suppose that the transmission line connected to port III is open-circuited 
at a reference plane III' and the impedance looking out from port III 
becomes jX 3 . Then, V 3 is given by - jX 3 I 3 . Substituting this into the 
simultaneous equations represented by v = Zi, we can eliminate V 3 and 
I 3 . The result is given by 

fLii.r {^ 1 i-Ar 1 8 3 (^3 + Ar 33 )- 1 } {x 12 -* 13 a' 23 (*3+*33)- 1 }] rM 

LT2J * \-{Xi 2 — X\zX2z(,X$ + X 33 ) -1 } {-^22 — ^23 (^3 + ^ 33 ) _1 } J I/2J 

If we shift the opcn-circuited position III' along the transmission line, X 3 
changes from —go to + 00 ; therefore, by choosing a proper position, we 
can make the off-diagonal components vanish, i.e., 

{*12 — *i 3 ^2 3 (*3 + * 33 ) 1 } = 0 (5.112) 

In other words, the coupling between ports I and II disappears when the 
transmission line is open-circuited at the proper position III'. If we look at 
the junction from ports I, II, and the new reference plane III', the impedance 
matrix must have the form 
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An equivalent circuit for this matrix is given by Fig. 5.4(a) where X' + X” = 
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(a) x' + x"=x 33 




(b) 




(c) 

Fig. 5.4. Construction of an equivalent circuit for a lossless reciprocal three-port 
network. 



X 33 . Since each of the two-port networks is lossless and reciprocal, they can 
each be represented by a series reactance, a transformer, and a certain length 
of transmission line, as shown from the discussion given in Section 1.3. The 
length of the line can always be chosen to be positive by adding an integer 
multiple of a wavelength. The equivalent circuit Fig. 5.4(a) can then be 
simplified to the form shown in Fig. 5.4(b) where the two series reactances 
are combined to give a single reactance X. From this discussion, an equivalent 
circuit for the junction looking in from the original ports I, II, and III can 
be represented by Fig. 5.4(c), where the line length attached to port III is 
again made positive by adding an integer multiple of a wavelength. In the 
equivalent circuit, the six real numbers characterizing the junction are 
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If either X 23 or X i3 is equal to zero, (5.112) may not be satisfied by any 
X 3 . In this case, we can use port I or port II in place of port III, in the above 
discussion, and the same type of equivalent circuit will be obtained. In 
case, any two of X 12 ,*i 3 , and X 23 are eqaul to zero, say X i2 and X l3 , 
port I is completely isolated from ports II and III. The three-port network 
actually consists of a two-port network and an independent one-port 
network. Setting n 3 = 0, Fig. 5.4(c) can still serve as an equivalent circuit 
for the junction. Finally, if X xl ,X l3 , and X 23 are all equal to zero, the 
junction consists of three independent one-port networks. 




i : o', n' 2 ; i 



Fig. 5.5. Another equivalent circuit for a lossless reciprocal three-port network. 

If we use the admittance matrix instead of the impedance matrix, an 
equivalent circuit, shown in Fig. 5.5, will be obtained, which is dual to 
Fig. 5.4(c). 

5.6 Symmetrical Y Junctions 

We shall now discuss some important properties of symmetrical Y 
junctions to familiarize ourselves with scattering matrices, as well as to 
prepare for Section 5.8 in which Y circulators will be studied. The circuit we 
shall discuss is a symmetrical junction with three identical transmission 
lines attached. These lines can be in the form of waveguides, coaxial lines, 
or any other type of transmission line supporting only one propagating 
mode. Let us assume that reference planes I, II, and III are symmetrically 
located with respect to the center of the junction, as shown in Fig. 5.6, and 
the reference impedances X i (/ = 1 , 2, 3) are all equal to each other. Since 
the reflection from a port corresponding to a unit incident wave must be the 
same from the symmetry regardless of the port number, we must have 
S lt — S 22 =S 33 . Similarly, the outgoing wave from port I, corresponding 
to a unit wave incident on port III, must be equal to the outgoing waves 





from port III or port II corresponding to a unit wave incident on port II or 
port I, respectively, and we have S 13 = S 32 = S 2l . In much the same way, 
we have another relation S l2 =S 23 = S 3l . However, S l3 may not be equal 
to S 31 if no reciprocal condition is assumed. The most general form of the 
scattering matrix for the symmetrical Y junction is, therefore, given by 




(5.113) 



If identical incident waves are applied to all three ports simultaneously, the 
outgoing waves from all the ports are expected to be identical to each other 
from the symmetry. In other words, we expect that 




is an eigenvector for S, where yjl is a normalizing factor. This can be verified 
by calculating Sx x as follows. 



1 

7 ^ 





— (Sn -l- S l2 + S J3 ) 



1 

7 ^ 




Thus, Xj is, indeed, an eigenvector of S, and the corresponding eigenvalue 
Aj is given by S u + S l2 + 5'i 3 . Similarly, if the incident waves have relative 
phase angles of 0°, 120°, and 240°, the outgoing waves will be expected to 
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have similar phase relations. Therefore, 




are expected to be eigenvectors of S, where 

a — (27r/3) 

Since Sx 2 and Sx 3 are calculated to be 

Sx 2 = (S, , + S 12 ^“ + S, x 2 , Sx 3 =(S n + S 12 e J2a + S 13 e Ja ) x 3 

x 2 and x 3 are, indeed, eigenvectors, and the corresponding eigenvalues are 
given by X 2 = S n +S i2 e ja + S l3 e j2a , and X 3 =S n + S 12 e j2a + S 13 e ja , res- 
pectively. 

Let us consider the electric field along the axis of symmetry as the center 
of the junction. We choose a coordinate system with the z axis coinciding 
with the axis of symmetry, the y axis in the direction of the wave incident 
on port I, and the x axis perpendicular to the y and z axes as shown in Fig. 
5.7. The field corresponding to x x is given by the superposition of three 
cases : 

(i) Port I has an incident wave 1/^/3 and the remainder have none; 

(ii) Similarly, port II has 1/^/3 and the remainder none; 

(iii) Port III has 1/^/3 and the remainder none. 

The axial component of the electric field in each case must be the same from 
the symmetry. Let us indicate it by E, then we have 3 E at the center corre- 
sponding to x 1 . On the other hand, we have to superpose the following 
three cases for x 2 ; 




Fig. 5.7. Top view of symmetrical Y junction and the coordinate axes. 





(i) Port I has an incident wave 1/^/3 and the remainder have none; 

(ii) Port II has e ia j ^/3 and the remainder none; 

(iii) Port III has e i2a jyj 3 and the remainder none. 

The axial components of electric field at the center are given by E for (i), 
Ee ja for (ii) and Ee j2a for (iii). Superposing these three fields, we have 

E( 1 + e ja + e j2a ) = 0 

which indicates that no axial component exists for x 2 . Similarly, the axial 
component corresponding to x 3 is equal to zero. 

Suppose that a thin conductor is introduced along the z-axis of the 
junction without destroying the symmetry; then the electromagnetic field 
corresponding to x x is affected, but those corresponding to x 2 and x 3 are not. 
This .conclusion follows from the fact that the thin conductor forces the 
electric field along it to vanish ; whereas the axial components for x 2 and x 3 
do not exist. Consequently, the eigenvalue X x can be changed without 
affecting the eigenvalues X 2 and X 3 by the introduction of a thin conductor 
along the axis of symmetry. 

Let us next consider the magnetic field perpendicular to the axis of 
symmetry. Let H x be the x component of the magnetic field when an incident 
wave of magnitude 1A/3 is applied to port I; then the x components 
corresponding to waves 1/^/3 incident to port II and port III are given by 
H x cos a and //, cos 2 a, respectively. The corresponding y components are 0, 
H x sin a and H x sin 2a. Suppose that the incident wave expressed by x t is 
applied to the junction, then the x and y components of the magnetic field 
are obtained by the superposition of the three cases as follows: 

H x = H x (1 + cos a + cos 2a) = 0, H y = H x (0 + sin a -f- sin 2a) = 0 

When x 2 is applied, the same components are given by 

H x = H t (1 + e ja cos a + e 2ja cos 2 a) = 

H y = H i (0 + e ja sin a + e 2ja sin 2a) = ijH x 

To interpret this result, let us follow the usual procedure of multiplying the 
expressions by yj 2 e jo,t and taking the real parts which gives 

H x = yjl | H x | % cos ( cot + 9 ) , H y = - yj2 \H X j | sin (cot + 9) 

where \H t \ and 9 are defined through 

H x = \Hi\e i6 
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Drawing a diagram similar to Fig. 2.23, as shown in Fig. 5.8, we see that 
the magnetic field maintains a constant magnitude and rotates clockwise 
with angular velocity (o ; i.e., the field is circularly polarized. Similarly, 
corresponding to x 3 , we have 

= H y = — %jH t 




t 

Fig. 5.8. Explanation for the circular polarization of magnetic field. 



Therefore, the magnetic field for x 3 is circularly polarized at the center and 
rotates counterclockwise with the same angular velocity (o as for x t . These 
results concerning the electric and magnetic fields at the center will be used 
in the discussion of Y circulators in Section 5.8. 

Now, let us restrict ourselves to the symmetrical Y junctions satisfying 
the reciprocal condition discussed in Section 5.2. Since P is equal to either 
I or -I from the symmetry, (5.80) gives S u = S Ji% A comparison with (5.1 13) 
shows that ^ 13 = S 12 . The eigenvalues of S become 



— S 1{ +2 S l2 



2 2 — A 3 — — S 1; 





(5.114) 

(5.115) 
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Let us further assume that the junction is lossless. Then S must be a 
unitary matrix from (5.87); i.e., 

S + S = I 

Multiplying x,(i = 1, 2, 3) from the right and x + from the left, we obtain 

W 2 = l (i = l, 2, 3) (5.116) 

The magnitudes of the eigenvalues are, therefore, all unity. If we set S t t = 0, 
we have X L = 2 S Y 2 from (5. 1 14), and X 2 = X 3 - -S 12 from (5. 1 1 5). However, 

(5.116) cannot be satisfied simultaneously with these eigenvalues. We con- 
clude from this that S 21 cannot be equal to zero for any symmetrical Y 
junction if it is lossless and reciprocal. If all the diagonal components of a 
scattering matrix vanish, i.e., if all the ports are simultaneously matched, 
the junction is said to be totally matched. With this terminology, the above 
conclusion of S n ^0 can be restated as follows: A symmetrical Y junction 
can never be totally matched if it is lossless and reciprocal. We can prove a 
slightly more general theorem: Any lossless reciprocal three-port network 
cannot be totally matched. Assume the contrary; then from the reciprocal 
condition (5.80) we have 

0 PlSl2 Pl^lS 

(PS), = PS = Pl S l2 0 p 2 S 23 

-Pi ^1 3 P2$23 0 . 

which is symmetrical with respect to the main diagonal. From the lossless 
condition (5.87) or equivalently from (PS) + P(PS) = P, we obtain 

Si3S 23 = 0, S I2 S 23 = 0, 5i 2 5 13 = 0 

where the off-diagonal components of the above relation are calculated. 
To satisfy the three equations simultaneously, at least two of the components 
*5*1 2 ’ •S’i 3, and S 23 must be zero. Suppose that S i2 and S i3 are zero; then 
the first diagonal component of (PS) + P(PS) = P cannot be satisfied. 
Similarly, every possible combination of vanishing components leads to 
a contradiction showing that the initail assumption of total matching is 
wrong. This completes the proof. 

Finally, let us ask how small we can make |5 n | 2 of a symmetrical Y 
junction when it is lossless and reciprocal? Eliminating S 12 from (5.114) and 
(5.1 15), we have 

3S, i = Aj + 2X 2 
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Writing X i and X 2 in the forms 

A, = e j ° l , X 2 = e j92 

the above equation gives 

9 |S U | 2 = 5 + 4 cos(0! — d 2 ) 

Since the minimum value for cos(0, — 0 2 ) is —I, the minimum value of 
\S n \ 2 is given by 

|Su| 2 = * 

In other words, one-ninth of the incident power is reflected back even with 
the best possible adjustment. The condition to give the minimum reflection 
is given by 

X t = — X 2 

As discussed previously, X j can be changed by inserting a thin conductor 
along the axis of symmetry without affecting X 2 ~ A 3 , and it is possible in 
most cases to satisfy the above condition. 



5.7 Tensor Permeability of Ferrites 

If e, fi , and a are ordinary scalar quantities, the product order in each 
term on the right-hand side of (5.69) can be changed without altering the 
value. Consequently, the terms on the right-hand side canceled each other, 
and the reciprocal relations were obtained. However, if the permeability is 
expressed by a matrix [ft] of order 3x3 and vectors are represented by 
matrices of order 3x1, then H 2 • [ft] Hj is not necessarily equal to II t • [//] H 2 
since the order of matrix product cannot generally be interchanged. The 
reciprocal relations do not follow, therefore, and the possibility arises that 
useful circuits can now be found which were not possible under the restric- 
tion of the reciprocal conditions, such as (5.80). For this reason, a large 
number of materials were investigated, and a class of magnetic materials 
called ferrites was experimentally found to have desired nonreciprocal char- 
acteristics when a static magnetic field was applied. The permeability of 
ferrites for small microwave fields is generally expressed in the form 



P -jic 0 

M = jK fi o 

.0 0 Ho 



( 5 . 117 ) 



where the z direction is chosen to coincide with the internal static magnetic 
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field H 0 . The values of fx and k in (5.117), and hence those of fi + K and 
fi — K, change with |H 0 |. The typical variations are shown in Fig. 5.9. 

The product of a matrix [ft] and a vector H gives another vector B. 
A matrix which gives a vector in this way, when multiplied by another vector, 
is called a tensor, and ferrites are therefore said to have tensor permeabilities. 
The only information required for a discussion of microwave circuits 



Fig. 5.9. The permeabilities ft + k and n — k as function of static magnetic field. 



containing ferrites is the knowledge concerning [//] expressed in (5.1 17), and 
the variations of fi + k and fi — K with |H 0 | as illustrated in Fig. 5.9. In this 
section we will briefly review the origin fo such a tensor permeability. 

Suppose that there is a small loop carrying a current I. Let A be the area 
inside the loop and k be the unit vector normal to the loop surface, as shown 
in Fig. 5.10. Then the current loop is said to have a magnetic moment given 
by m = IAk. In an atom each electron is orbiting and spinning, thereby 





Fig. 5.10. Current loop. 
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forming two types of current loop. Ordinary substances can therefore be 
considered as a large number of current loops floating in a vacuum. 
Classifying the current loops by their current / and /fk, let us assume that 
there are many different kinds of current loops, and that the density of the 
y'th kind is given by n j per unit volume in the vicinity of r. Summing all the 
the magnetic moments per unit volume, we can define a quantity M called 
the magnetization at r by 

= M (5.118) 

j 

The magnetization is the density of the total magnetic moment. 

Now consider a macroscopic surface S in the space where the current 
loops exist, as shown in Fig. 5.11, and let us calculate the net current 




Fig. 5.11. Macroscopic surface S and its periphery C used in the derivation of i m = 
V x M. 

crossing the surface, which is given by J i m • n dS, where the integration is 
over S; i m is the effective current density due to the current loops and n the 
unit vector normal to S. If C is the periphery of S, then the current loop 
which does not enclose C has no net contribution to the integral since, for 
such a loop, the same current crossing S from one side also crosses S from 
the other side. In order to calculate the net contribution, let us estimate the 
number of loops of the y'th kind which encircle a short length dl of C. 
Although dl is considered to be macroscopically short and almost straight, 
it is very long compared to the size of the current loops. For a current loop 
to encircle dl , a certain point of the loop surface, say the center of gravity, 
has to be located inside a cylinder whose volume is ( Ak)j • dl, where dl is the 
vector representing the axis of the cylinder, as shown in Fig. 5.12. Since the 
density of the current loops is the number of current loops which encircle 
dl is given by (Xk)jtfd\ on the average, each of which contributes Ij to the 
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represents the free current density and does not include the effect of the 
current loops due to bounded electrons in atoms. If we include all the 
current effects on the right-hand side, it should hold in magnetic materials as 
well. Thus, we have 

V x Ho l B = i + i w + ( SDjdt ) = i + V x M + ( dT>/dt ) 
or equivalently 

V x (hq 1 B — M) = i + (dD/dt) (5.121) 

A comparison of (5.120) and (5.121) shows that 

B = /r 0 (H + M) (5.122) 

For a given applied H, B becomes larger in magnetic materials than in 
vacuum due to the effects of bounded electrons in atoms which create M in 
(5.122). 

In ordinary nonmagnetic materials, the effects of bounded electrons in 
atoms cancel each other and give no net contribution to M. In ferrites, 
however, the magnetic moments due to the effective current loops of some 
of the spinning electrons remain uncanceled and give rise to the magnetic 
properties. An electron has its own mass and charge, and since the charge 
is negative, the effective current loop is formed in the opposite direction 
from the spin direction. Consequently, its magnetic moment m and angular 
momentum J have opposite directions as shown in Fig. 5.13. If we write 
the relation between m and J in the form 

m = y J (5.123) 

then y must be negative. Also y is considered to be one of the fundamental 



0m 




Fig. 5.13. Electron angular momentum J and magnetic moment m. 



constants of an electron, and its value has been experimentally determined 
as 

y = — 1.76 x 10 11 [radian -m 2 /weber* sec] 

When a current loop with magnetic moment m is placed in flux density 
B m , the loop receives a torque m x B m in a direction which tends to maximize 
the flux linkage. This is the same kind of torque which causes a magnetic 
needle to point at the north pole. In general, the time derivative of angular 
momentum must be equal to the torque applied. In the case of an equivalent 
current loop due to an electron spinning about its axis in a ferrite, we have 

dJ 

df =mxB " 

Using (5.123), this becomes 

dm 

— = ym x B m (5.124) 

As we can see from (5.122), B m consists of two terms, one due to the internal 
magnetic field H f , and the other due to the magnetization M;. Let us now 
assume that the internal magnetic field is sufficiently strong to saturate the 
ferrite, in which case all the m’s in a localized region will have the same 
direction. Then jiqM, gives no contribution to the right-hand side of (5.124), 
and B m can be replaced by /i 0 H; since m and M, are in the same direction. 
Multiplying by the number of m’s per unit volume, (5.124) becomes 

dMi 

-^- = WoM i xH i (5.125) 

It is worth noting that H f is generally not equal to the external magnetic 
field applied to the ferrite because of a demagnetizing effect. 

Let us now write H f as the sum of the static magnetic field H 0 and the 
microwave field He Jtot , and let us write M, as the sum of M 0 and Me Jojt : 

H, = H 0 + Re {y/2 He Ju>t } , M, = M 0 + Re {^2 Me Ja>t } (5.126) 

Assuming that the directions of H 0 and M 0 coincide, we take the z axis to 
be this direction. Furthermore, let us assume that the magnitudes of H and 
M are both small compared to those of H 0 and M 0 , respectively. Substituting 
(5.126) into (5.125) and neglecting the product of H and M, we have the 
following relations between the alternating components : 

j(oM x = y n 0 [ M y H 0 - AfoJ/J , jo)M y = y/< 0 [M 0 H X - M X H 0 ] , jo)M z = 0 
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Solving these equations for M x , M y , and M z , we have 
Vf _ y 2 Vo 2M 0 H 0 H x - j(oyn 0 M 0 H y 

, . j<#MoM 0 H x + y 2 p 0 2 M 0 H 0 H y 



y 2 p 0 2 H 0 2 -co 2 



(5.127) 



M, « 0 



The alternating component of magnetic induction B is given by fi 0 (H +M); 
B can also be expressed as [p] H, using the tensor permeability [/i] in (5.1 17); 
therefore, we have 

' p — jK O' 

B = p 0 (H + M) = jK p OH 

Lo 0 p 0 _ 

A comparison of this equation with (5.127) shows that 

y 2 Ho 3 M 0 H 0 \ co 0 co M j 



y Ho M 0 ti 0 co 0 co M 

V Vo "b 2 2 rj 2 2 — Vo ] 1 + 2 : 

7 Vo Hq — w ( (o 0 — (o 



(oyp 0 2 M Q 
y 2 n 0 2 H 0 2 - <o 2 



( 0(0 M 

Vo 2 2 

(Oq — (O 



(5.128) 



where 



®o = - VVo^o , o>m = ~ VVoK 



The term to 0 is the natural precession frequency of the electrons and is 
proportional to the magnitude of the static magnetic field H 0 . The term 
(o M is proportional to M 0 , the saturation magnetization which is a property 
of the ferrite under consideration. From (5.128), we obtain 



V + k = Vo -U + 



(o 0 + (O 



V~k = H 0 < 1 + 



(5.129) 



Since co 0 is proportional to H 0 , (5.129) gives a good explanation of why 
fi + K and /i — » c vary with H 0 as shown in Fig. 5.9, excluding the region 
where H 0 is small, and the assumption of ferrite saturation is not valid. 

When H is perpendicular to the z axis and circularly polarized, it is 
proportional to either 

rn r i i 
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depending on the direction of rotation as we discussed in Section 5.6. The 
magnetic induction B is given by (p + k) H or (p. — k) H if H is proportional 
to the first or second vector, respectively. In other words, the effective perme- 
ability of the ferrite for the circularly polarized field becomes // + k or 
p — K depending on the direction of rotation. 

Both p and k become infinite in (5.128) as g> approaches co 0 ; however, in 
practice, various effects neglected in the above discussion, such as spin-spin 
and spin-lattice interactions, introduce losses, and consequently p and k 
remain finite. This situation is similar to that of a cavity near the resonant 
frequency ; no matter how small the losses are, the fields are kept from growing 
infinite. In addition to the above-mentioned losses, introduced at or near 
(o Q , when the applied magnetic field is below saturation, ferromagnetic 
domains are formed, and resonances due to the natural internal magnetic 
fields in the domains may also absorb microwave energy. Furthermore, the 
domains tend to move under the influence of low frequency microwave fields, 
thereby introducing another loss-mechanism in the ferrite. In order to 
minimize the effect of these “low-field” losses, H 0 must be made sufficiently 
large to saturate the ferrite. 

5.8 Three-port Circulators 

In Section 5.6, two of the eigenvalues of S, X 2 and 2 3 , were found to be 
equal to each other for a symmetrical Y junction when it was reciprocal. 
Suppose, however, that a piece of ferrite is introduced at the center of the Y 
junction without destroying the symmetry, then A 2 and A 3 may no longer be 
the same since the restricting reciprocal condition has been removed. The 
eigenvectors, on the other hand, remain the same because of the symmetry, 
and they are given by x l5 x 2 , and x 3 defined in Section 5.6. 

Let us assume that X u X 2 , and X 3 are somehow adjusted to have the same 
magnitude but different phases 120° apart from each other; i.e., 

X 2 = X i e ja , X 3 = X t e /2a 

where a is given by 2it/3, as before. Under this assumption, if a unit wave is 
incident on port I; i.e., if 

T 

a = 0 = ( Xl + x 2 + x 3 )/ x /3 
. 0 . 
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is incident on the junction, then the reflected wave b is given by 
b = Sa = (SXi + Sx 2 -f S x 3 )/ > /3 

= + e j2a x*)lsl^ 

'O' 

= i, 0 

. 1 . 

where use is made of the fact that the eigenvectors are multiplied by their 
eigenvalues when S is applied. The above result indicates that X { emerges 
from port III, and nothing emerges from port II when a unit wave is incident 
on port I. Similarly, when a unit wave is incident on port II or port III, X t 
emerges from port I or port II, respectively. The junction is called a circulator 
when there is a rotational relation between the incident and reflected waves 
as illustrated in this example. If the order of leading phases is changed from 
X\,X 2 , and X 3 to X 1 , X 3 , and X 2 , an incident wave on ports I, II, and III 
emerges from ports II, III, and I, respectively, and a circulator with rotational 
direction opposite to the previous case will be obtained. 

With this much preparation, let us now consider how the phases of the 
eigenvalues can be adjusted to 120° apart. Let v* and i, be the voltage and 
current at the junction ports when x ; is incident. Furthermore, let the 
superscript (0) indicate the case with no static magnetic field applied to the 
ferrite. Noting that b = Sx f = Jl,x ( , we have from (5.100) and (5.101) 

(*!”’), i; - W, i, !0) = (G + x,- + GilM 2F,P,2FP(x J . - AjXj) 

- (G + Xj + GXjXj) t 2F,P,2FP( Xi - 2 !°>x,) 

Since F, G, and P are diagonal matrices, transposing them introduces no 
change, and the order of their product can be interchanged. Furthermore, 
since each term on the right-hand side is a matrix of order 1 x 1, we can 
take the transposed matrix of the second term without changing the result. 
Thus, a little manipulation shows that 

( v i Or — ( y j)t = ( x f)» (^f 0) ~ Xj) 4b 2 (G + G + ) X; 



Using the relation 4F 2 (G + G + ) = 2P, this reduces to 

OTX ~ (vjX il 0) = 2(Aj°> - Xj) ( Xi ), Px, 



(5.130) 



The left-hand side can be rewritten in terms of the electric and magnetic 
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fields at the port reference planes : 

(vi 01 ), i, - (Vj), ii 0) = I J {E<°> X H, - E, x H<°>} -k dS 

where the summation is over all the ports. Let n be the outward unit vector 
normal to a closed surface S enclosing the junction ; then n = — k over the 
reference planes, as illustrated in Fig. 5.1. Therefore, the right-hand side 
becomes 

- Z J {E! 0) x — Ej x Hi 0) } • n dS = - J V • {E| 0) x H, - E,- x H t ‘ 0) } dv 

where use is made of Gauss’s theorem. Following the method employed in 
(5.69), the volume integral can now be calculated 

J V • {E! 0) x Hj - Ej x HS 0) } dv = | j(oH\ 0) -([/i] - n (0) ) H , dv 

As a result, (5.130) becomes 

(A,-Ai°>)(x j ),Px j = a< 0 /2)| H! 0) • ([/,] - n m ) Kjdv (5.131) 

Setting i = j = 1, the change of X t , due to the application of a static magnetic 
field on the ferrite, is calculated to be 

- 4 0> = ± i. i°> J H< 0) •([//]- ^ (01 ) H, dv 

where the upper and lower signs correspond to cases where P is equal to 
I and - I, respectively. Similarly, setting either / = 3, j = 2 or i = 2, j = 3, 
we have 

*2 - 4 0) - ± ijo> J h<3 0) • (M - /I 10 ’) h 2 dv 
A, - A< 3 °> = ± ijm J H‘°> • (M - ^ (0 >) H 3 dv 

where use is made of the relation 2 ( 2 0) = A ( 3 0) . 

Now suppose that the static magnetic field is gradually increased in the 
axial direction of the Y junction. The initial contribution to the integral 
in the first equation from the axial component of Hj is negligibly small, 
since p 0 = // 0) . Furthermore, the component perpendicular to the axis is zero 
at the center and small in the vicinity of the center; hence, will 
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remain small. On the other hand, H 2 and H 3 have perpendicular components, 
and both \X 2 -2. ( 2 0) \ and |/l 3 -A ( 3 0) | grow rapidly with increasing static 
magnetic field. The magnetic fields H 2 and H 3 are respectively proportional 
to 




along the center axis; consequently, we have 

' H —jK 0 1" 

H 2 ~H« 0) H 2 [l -i 0] jk !X 0 j 

. 0 0 n Q _o_ 

= Hi 0) H 2 2{fi + k) 

where H 3 0) and H 2 are complex quantities representing the x components 
of H ( 3 0) and H 2 , respectively. Similarly, we have 

HiP-M H 3 - H^H 3 2{p - k) 

From these relations, the changes of X 2 and X 3 due to the static magnetic 
field are given by 

X 2 - 4°> ~ ± j(o {(/i + k) - n w } | H ( 3 0) H 2 dv 

X 3 - A ( 3 0) a ± jco {(fi -k)~ /i (0) } J Hi 0) H 3 dv 

When the static magnetic field H 0 is small, p. and k vary with H 0 , as shown 
in Fig. 5.14, and p remains approximately equal to p (0) . Furthermore, 
f/ 2 0) is equal to H^ 0) and, both H 2 and H 3 are expected to be initially close to 
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this value which corresponds to zero static magnetic field. Therefore, the 
above equations show that X 2 — A ( 2 0) and X 3 — A 3 0) must change proportion- 
ally to k and — k, respectively. That is, with increasing H 0 , X 2 and X 3 move in 
opposite directions starting from the original value X ( 2 } = A 3 0) . If we neglect 
the losses in the junction, the magnitudes of the eigenvalues become unity, 
and they should appear as illustrated in Fig. 5.15(a). If we further increase 




Fig. 5.15. Variations of the eigenvalues Xi, Xz, and A 3 with static magnetic field. 

H 0 , then H 2 and H 3 may no longer be close to their original value, and hence 
X 2 and X 3 cease to be symmetrical with respect to the original eigenvalue 
X 2 0) = A^ 0) . At some value of H 0 , the angle between X 2 and X 3 will become 
120°, and X { may have shifted slightly from X^ since Hj is not exactly equal 
to zero over the ferrite. As a result, X 1 ,X 2 , and X 3 will have taken the positions 
in Fig. 5.15(b). If we now insert a thin conductor along the center axis and 
adjust X t , without changing A 2 and X 3 as explained in Section 5.6, until the 
angles from X t to X 2 and X 3 become 120°, then the junction becomes a 
circulator. 

In practice, we carry out the reverse process to minimize the insertion loss. 
First, we choose the dimension of the ferrite in such a way that it resonates 
at the desired frequency as a transmission type cavity (i.e., no power trans- 
mission takes place at detuned frequencies) with no static magnetic field 
applied. To observe the resonance clearly, we may have to decrease the 
coupling between the waveguides and the central part of the junction where 
the ferrite is located. Once the dimensions are properly chosen, we apply the 
static magnetic field and observe the split of the resonance curve due to the 
removal of the degeneracy. If a wrong resonant mode was picked up, the 
split would not occur. As we increase the static magnetic field, the separation 
between the two resonances becomes wider, and at a certain point, the lower 



252 



5. MATRICES AND WAVEGUIDE JUNCTIONS 



resonant frequency becomes stationary and then starts increasing. This 
corresponds to the point where n + k starts decreasing in Fig. 5.9 and 
indicates that the ferrite is saturated. We fix the static magnetic field at this 
value to obtain the minimum low field losses without suffering excessive 
losses due to the ferromagnetic resonance discussed at the end of Section 
5.7. We then adjust the coupling between the waveguides and the central 
part of the junction to obtain proper circulator action. The reason for the 
success of this procedure can be seen as follows. At detuned frequencies, the 
eigenvalues X u X 2 , and X 3 are identical if the original resonance is a trans- 
mission type. As we increase the frequency and pass through the resonances, 
X 2 first rotates 360°, since H 2 sees /j + k, and then X 3 rotates 360°, while X t 
more or less stands still. By changing the couplings, we change the text’s, 
and hence the speed with which X 2 and X 3 rotate. When the speed is properly 
adjusted, X u X 2 , and X 3 will be positioned 120° apart from each other at 
(or at least near) the desired frequency, and the junction will become a 
circulator. 

Let Z 0 be the reference impedance for the symmetrical circulator, then 
the impedance looking into the circulator from each port must be Z 0 *. 
If we connect a lossless two-port network to port I in order to transform 
Z 0 * to Z t *, and consider the other port of the two-port network as a new 
port of the junction with reference impedance Z u then the junction acts as a 
circulator with the reference impedances Z l5 Z 0 , and Z 0 at ports I, II, and 
III, respectively. Similarly, by adding a lossless two-port network which 
transforms Z 0 * to Z 2 * at port II and another which transforms Z 0 * to Z 3 * 
at port III, we obtain a circulator with reference impedances Z u Z 2 , and, 
Z 3 ; these impedances are arbitrary as long as their real parts have the same 
sign as that of Z 0 . 

The general form of a lossless circulation is given by either 

‘ 0 e je ' 0 [00 e J ° 1 ' 

S = 0 0 e* 2 or S= e j02 0 0 

.e m 0 0 [o e* 0. 

depending on the direction of the circulation, where 0 r , 0 2 , and 0 3 are 
arbitrary real angles. Now suppose that the scattering matrix is given by the 
first form, and that an impedance Z, instead of Z 3 , is connected to port III, 
and a unit power wave is incident on port I. Since the relation between the 
voltage and current at port III is given by V 3 = — ZJ 3 , a 3 is given by 



a 3 = {(Z-Z 3 )/(Z + Z 3 *)} b 3 



(5.132) 
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where use is made of (5.74) with i = 3. Substituting this into the relation 
b = Sa, and noting that a t = 1 and a 2 = 0, we have 

b 2 = e J<,J+,5 >{(Z - Z 3 )/(Z + Z 3 *)} 

Assuming that the real parts of the Z- s are all positive, the output power 
at port II therefore becomes |(Z- Z 3 )/(Z + Z 3 *)| 2 times the incident power 
on port I. If ReZ is negative, |(Z — Z 3 )/(Z + Z 3 *)[ > 1 and power amplifi- 
cation is obtained. 

PROBLEMS 

5.1 Following the discussion on self-adjoint matrices, obtain the spectral representation 
df a unitary matrix. Show also that if two unitary matrices commute, they can be 
simultaneously diagonalized by a unitary matrix through the similarity transfor- 
mation. 

5.2 Suppose that three rectangular waveguides are connected to form a four-port 
junction, as shown in Fig. 5.16(a), and each has only one propagating mode TEio, 





% 



Fig. 5.16. (a) Magic T t (b) Intuitive explanation of the phase relation between ports 
I and II. 
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then an incident wave on port III will be divided in two and will emerge from ports I 
and II with equal, but out-of-phase, amplitudes, as shown in Fig. 5.16(b). Port IV 
has no output since the wave into port III tries to excite a TEoi mode in the port IV 
waveguide which is in the cutoff region. Similarly, an incident wave on port IV will 
emerge from ports I and II in-phase but nothing emerges from port III. This wave- 
guide junction is called a magic T. Assuming total matching, calculate the scattering 
matrix of a lossless magic T. 

5.3 Repeat the discussion given in Section 5.6 for symmetrical cross junctions. 

5.4 Calculate the internal static magnetic field Ho that is necessary for the natural 
precession frequency of electrons to be 4000 MHz. 

5.5 Suppose that a thin ferrite rod is inserted along the center axis of a circular waveguide 
and a static magnetic field is applied in the axial direction. Then the plane of polar- 
ization of the TEn mode will rotate with distance along the axis. This phenomenon 
is known as Faraday rotation. By decomposing the linearly polarized wave into two 
oppositely rotating circularly polarized waves, explain why such a rotation takes 
place. 

5.6 Adjust the length of the above ferrite rod and/or the static magnetic field so as to 
rotate the plan e of polarization 45° and connect transducers from the circular TEn 
to the rectangular TEio mode at both ends, after twisting one 45° with respect to 
the other, as shown in Fig. 5.17. If a resistive film is placed in each transducer to 




absorb the mode corresponding to the rectangular TEoi, the device acts as an 
isolator; the wave passes through the device in one direction with little attenuation, 
whereas it is almost completely absorbed in the other direction. Explain qualitatively 
how the isolator action takes place. 

5.7 Prove that a lossless symmetrical Y junction is a circulator if it is totally matched. 

5.8 If the ferrite dimension for a symmetrical three port circulator is chosen so as to 
reflect the incident power at the resonant frequency, and if a separate resonator is 
properly coupled to xi mode only, then, with a proper magnetic field applied, the 
circulator action takes place over a relatively wide frequency range. This is because 
all three vectors /li, A 2 , and A 3 rotate in the same direction with similar velocities 
maintaining the proper phase relation (120° apart) near the resonance. Explain 
qualitatively how this proper phase relation can be obtained. 




5.9 Prove that, if reference impedances are changed from Zf to Zi (/ = 1, 2, ..., «), the 
scattering matrix is transformed from S to S'; 

S' = A -1 (S - r + ) (I - FSO^A* 

where T and A are the diagonal matrices with ith diagonal components n = (Zi' — Z<)/ 
(Zi' + Zt*) and |1 - /- i ri*| 1 ' 2 (l — n*)/\l — n\, respectively. 
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CHAPTER 6 

COUPLED MODES 

AND PERIODIC STRUCTURES 



When several transmission lines are coupled together through pertur- 
bations of various kinds, we can treat the resultant complicated trans- 
mission system as a waveguide and develop an appropriate new theory for 
it as we did in Chapter 3. Instead of proceeding in this manner, however, it 
is more advantageous in many cases to discuss the complicated system as a 
combination of individual simple transmission lines; in this way we can 
make full use of knowledge concerning individual transmission line properties 
since these are generally well understood from separate studies. In this 
chapter, we shall study one such approach called the theory of coupled 
modes; this provides a powerful tool for the understanding of relatively 
complicated combined systems often encountered in practice. 

The discussion will reduce to an eigenvalue problem of a matrix. However, 
since the matrix is not self-adjoint, a new method is developed to study the 
eigenvalue problem. This method is applicable to many problems, including 
the discussion of periodic structures. 

First, we set up an equation for a system with lossless distributed coupling, 
and then deduce the eigenvalue problem. After discussing the general 
properties of the solutions, we solve the eigenvalue problem using a pertur- 
bation method. We then discuss in detail the interaction of two waves as 
the simplest, important example of coupled modes. We next formulate 
two eigenvalue problems for periodic structures, one for lossless systems, 
and the other for reciprocal systems, and then discuss the properties of their 
solutions following the method developed for distributed coupling. Finally, 
taking the specific example of a waveguide with periodically placed identical 
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discontinuities, we discuss the co-fi diagram and the space harmonics; these 
are found to be particularly useful concepts when a periodic structure is 
designed for a practical application. 

6.1 Distributed Couplings 

Suppose that n propagating waves exist along a transmission system and 
other waves can be neglected, either because they are in the cut-off frequency 
ranges or because they are not excited. Let us indicate the magnitudes and 
phases of the n waves by complex numbers a, (z), a 2 (z),..., a n (z) and let 
a(z) be a vector having the a n {z)\ as its components. We assume that the 
waves are orthogonal to each other and normalized so that /> i |a i (z)| 2 gives 
the transmission power of the zth wave in the positive z-direction, where 
pi is 1 or — 1, and that the total transmission power is equal to the sum of 
individual transmission powers. In other words, the total power is given by 

P = Y, Pi\ a i( z )\ 2 ~ a+ ( z ) Pa (z) (6.1) 

where 

P = diag [p,, p 2 , ..., p„] (6.2) 

and a + (z) is an abbreviation for (a(z)} + . In Chapter 3 the waves in a guide 
were shown to have this property. 

Now, let us introduce some couplings between the n independent waves. 
The wall losses discussed in Section 3.8 certainly provide an example of 
coupling, however, in the present discussion we shall restrict ourselves to 
lossless couplings. An example of lossless coupling is shown in Fig. 6.1 in 
which two waveguides are coupled together through a large number of small 
windows in the common sidewall. If each waveguide has only one propaga- 
ting mode, and if the waves propagating in the negative z-direction can be 
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neglected because of the method of excitation, a x (z) may represent the wave 
propagating in the positive z-direction in one waveguide, and a 2 (z) in the 
other. The original unperturbed transmission lines in this case are two 
waveguides in parallel, each supporting only one propagating mode. 
Strictly speaking, a finite number of waves cannot satisfy the boundary 
conditions which are modified by couplings; however, for minor modifica- 
tions, the effect of other waves on the transmission power may be negligible, 
and hence the total power is still given by (6. 1) to a first order approximation. 
In the following discussion, this is assumed to be the case, and we shall 
study how a(z) changes with z. 

The vector a(z) represents the magnitudes and phases of the waves at z, 
and hence a (z + Az) represents those at z + Az. Since the relation between 
the electromagnetic fields represented by a(z) and a(z + Az) is linear, there 
must be a linear relation between a(z) and a(z + dz). When higher order 
terms of Az are neglected, this linear relation should be expressible in the 

f QTm 

a(z + Az) = a(z) — Ca(z) Az (6.3) 

since a(z + Az) should coincide with a(z) in the limit of Az-> 0. The square 
matrix C thus introduced is called the coupling matrix. The negative sign in 
front of C is conveniently chosen so as to enable us to reach an eigenvalue 
problem in a conventional form. Transposing a(z) to the left-hand side, 
and taking the limit of dz-+ 0 after dividing both sides by Az, we obtain 
a matrix differential equation 

= — Ca(z) (6.4) 

dz 

The transmission power in the positive z-direction is given by (6.1), and 
this must be independent of z when the system is lossless. Therefore, we have 

^-{a + (z)Pa(z)} = 0 (6.5) 



Using the standard formula for differentiating a product of functions, we 
can write (6.5) as follows. 

da 1 (z) , da(z) 

-^Pa(z) + a + (z)P — = 0 

Substituting (6.4) into this expression gives 

— a + (z) (C + P + PC) a(z) = 0 



( 6 . 6 ) 
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Since this equation holds regardless of the value of a(z), we obtain 

C + P + PC = 0 (6.7) 



This is the condition which the lossless coupling matrix C has to satisfy. 
From (6.7), the diagonal components of C must satisfy 

Q = - CJ! (6.8) 

in other words, the diagonal components of C are all pure imaginary. To 
evaluate the magnitude of C u , let us assume that the magnitudes of all the 
waves except a,(z) are zero at z, then from (6.4) we have 



<M Z ) 

dz 



= ~ C,j<ij(z) 



Suppose that a^z) changes with z as exp(— jf} t z) before the couplings are 
introduced; the introduction of small couplings represented by C should, 
in general, cause a f (z) to behave somewhat differently. However, when no 
other waves exist at z, as we have assumed, no effects are expected through the 
couplings, and hence a t {z) is expected to change as exp (—jpjZ). A sub- 
stitution of this functional form into the above equation shows that C u 
must be approximated by jfl t which is pure imaginary as required form (6.8). 

From (6.7) the off-diagonal components of C have to satisfy 

Cu = " C*i ( Pi - Pj) (6-9) 

Cu = C* ( Pi = - Pj ) (6.10) 

The first condition applies when Pi and Pj have the same sign while the 
second one applies when the signs are opposite. Note that these conditions 
impose a restriction on the relation between C u and C y , but no direct restric- 
tions on the value of C y itself. 

In order to derive an appropriate eigenvalue problem from (6.4), let us 
assume that a(z) changes exponentially with z, i.e., a(z) = xe~ yz , as we did 
in the discussion of waveguides. Substituting this expression into (6.4), we 
obtain the desired equation 

(C-yI)x = 0 (6.11) 

It is expected that any solution of (6.4) may be expressible as a linear 
combination of the solutions of the eigenvalue problem (6.1 1); the behavior 
of these solutions is simple and well understood, and hence a systematic 
eigenfunction approach to the coupling problem should become possible. 
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For this reason, we shall proceed with a study of the above eigenvalue prob- 
lem and establish the basis for the eigenfunction approach. 

First, we note that the discussion of the eigenvalue problem of self- 
adjoint matrices given in Section 5.1 is not applicable to the present problem, 
since C + is not equal to C. Using the relation (6.7), however, we can derive 
several important theorems. Equation (6.11) represents n simultaneous 
linear equations for the components of x. To ensure nontrivial solutions, 
the determinant of the coefficients must equal zero which leads to an alge- 
braic equation of the «th degree for y whose roots give the eigenvalues, 

det (C — yl) = 0 (6.12) 

Let us consider the case in which all the roost are distinct, then the 
following four theorems I-IV can be derived. The proof will be given after jj 

each statement. j: 

I. Any w-dimensional vector can be expressed as a linear combination of .-I 

the eigenvectors. 

Proof. Since each root of (6.12) gives an eigenvector, we obtain n different f 

eigenvectors. Suppose that the eigenvectors x l5 x 2 > ..., x^ are independent I 

(i.e., i 1 a i x i = 0 cannot be satisfied unless the #,’s are all zero), but that 
x p ( p ^ n ) is not, then we have 

p-i 

x P = £ «* x * (6.13) 

* =1 H 

t 

Applying C from the left, we obtain : 

p - 1 

y p x p = £ a,y*x, (6.14) 

k = i 

Substitution of (6.13) into (6.14) gives 

E (V* - Ip) «*** = 0 (6-15) | 

it = i §| 

By hypothesis, y k # y p and not all of the a,’s are equal to zero, (6.15) W 

therefore shows that x l5 x 2 , ..., x p _j are not independent of each other which 
leads to a contradiction. Thus, x p is proved to be independent of x t , x 2 , . . . , x p _ t 
provided that these p — 1 eigenvectors are independent of each other. Pro- f 

ceeding from p = 2 and increasing p in unit steps, we see that all the eigen- | 

vectors are independent of each other. Since an arbitrary vector can be si 

expressed as a linear combination of n independent vectors in an «-dimen- f j 

"> 'i 

£ i 

i 
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sional space, and since the eigenvectors form a set of n independent vectors, 
any vector can be expressed as a linear combination of the eigenvectors. 

II. If y k is an eigenvalue, - y k * also becomes an eigenvalue. 

Proof. Since y k is an eigenvalue, we have 

det (C- y,I) = 0 

Taking the complex conjugate transposed determinant, we obtain 

det(C + — 7 ,*I) = 0 

Multiplying det P _1 and det P from the left and from the right, respectively, 
the above equation becomes 

det(P _ 1 C + P — y fe *I) = 0 

Replacing C + P by -PC, which are equal to each other from (6.7), we 
obtain 

det(C -t- 7 ,*I) = 0 

which shows that — 7 ,* is an eigenvalue of C. When 7 , is pure imaginary, 
the theorem is trivial since 7 , is equal to — 7 ,*, however, when 7 , has a 
real part, the theorem asserts that an eigenvalue having the opposite sign in 
front of the real part also exists. In other words, whenever a mode exists 
which grows with distance, a corresponding decaying mode exists having 
the same phase constant. 

III. Let 7 , and 7 , be the eigenvalues corresponding to eigenvectors x k and 
Xj, respectively. If 7 , ^ - y*, there is an orthogonal relation between x, and 
x, of the form 

x, + Px t = 0 (6.16) 

Proof. By definition, we have 

Cx * = , Cx ; = 7 /Xj 

Multiplying the first equation by x, + P from the left, and the adjoint of the 
second equation by Px, from the right, and then adding the two equations, 
we have 

x/PCx, + x, + C + Px, = ( 7 , + 7 /*) x, + Px, 

The left-hand side can be rewritten in the form x, + (PC + C + P) x,, which 
is equal to zero from (6.7). Therefore, the right-hand side must also vanish 
which means that (6.16) holds, since 7 , 4 - 7 /* is not equal to zero by hy- 
pothesis. 
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IV. If y k = — y*, we have 

X/ + Px£ # 0 (6.17) 

Proof. By theorem I, Px, can be expressed as a linear combination of the 
eigenvectors, i.e., 

Px / = X 

m 

where the summation is extended from m = 1 to n. Multiplying the above 
equation by (Px,) + = x, + P from the left, and since x, is not equal to zero, 
we have 

0*(Px,) + (Px,) = X V ,-Px m 




I 



However, all the terms on the right-hand side, except the &th term for which 
y k — — yi*, are equal to zero. Therefore, we obtain 

a*x, + Px* ^ 0 

from which (6.17) follows immediately. 

When y* is pure imaginary, let us define \ k to be 

x* = cx k + (6.18) 

On the other hand, when y k has a nonzero real part 

\ k = cxf (619) 

where \ ( is the eigenvector corresponding to y t = — y k * whose existence is 
guaranteed by Theorem II. It follows from Theorem IV that x fc Px A is not 
equal to zero provided the constant c is not equal to zero. Let us choose the 
value of c in such a way that x*Px fc becomes unity, then we have the 
orthogonality and normalization conditions 

x,Px k = 0 (l^k) (6.20) 

x*Px* = 1 (6.21) 

The normalization condition does not uniquely determine x k ; if one wants 
to determine it uniquely, another condition must be imposed such as 
x*j + Xfc = 1, in our present discussion, however, this is not necessary. 

The above results are derived under the assumption that all the n eigen- 
values are distinct, however, (6.12) may have one or more multiple roots. 
In such a case, the number of distinct eigenvalues is less than n, and the above 
technique of proving Theorem I fails. Fortunately, we can still obtain n 
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independent eigenvectors, as we shall discuss next, and hence the theorem 
itself holds without modification. 

Let C' be a coupling matrix satisfying (6.7), and let us consider C + eC', 
where C is the coupling matrix under consideration, and g is a small positive 
number. The n eigenvalues of C + aC' can be made distinct from one another 
by choosing C' and e properly; the eigenvectors x fc (e) of C + aC' and the 
corresponding x k (e)’s will then satisfy (6.20) and (6.21). The eigenvalue 
y k (e), corresponding to x*(g), is a root of 

det(C + aC' -yl) = 0 

The coefficients of this algebraic equation vary continuously with g, hence 
the root also changes continuously with e since it is a continuous function 
of the coefficients. In other words, y*(g) is a continuous function g. 

The components of x*(g) are given by the solutions of n simultaneous 
equations 

(C + cC' - y* (g) I) x = 0 

Since y fc (g) is assumed to be a simple root, the ratios of the components of 
x fe (g) are given by 

— D, : - D 2 :D 

where D is the determinant of the coefficients of the above simultaneous 
equations with its /7th row and /?th column removed, and D x is this modified 
determinant except that the /th column is made up of the last coefficient in 
each equation. Therefore, the ratios also vary continuously with g. 

If we define y fc , x k , and x k by 

y k = lim y k (g) , x* = lim x k (g) , x k = lim x k (g) 

£—♦0 £-*0 £-*0 

respectively, then y k , the component ratios of x* and those of x fc are uniquely 
determined since they are all continuous functions of a. Furthermore, y k 
and x k respectively become an eigenvalue of (6.11) and the corresponding 
eigenvector. Since the x*(g)’s and the x*(a)’s satisfy (6.20) and (6.21) regard- 
less of the value of e, the x k 's and the x*’s thus defined also satisfy these two 
equations. The x*’s are independent of each other from the proof of theorem 
I provided the corresponding eigenvalues are distinct. However, for the 
degenerate eigenvectors, we have to modify the discussion as follows. 
Suppose that they are not independent, then we have 

I a i x i = 0 




264 



6. COUPLED MODES AND PERIODIC STRUCTURES 



where the x,’s are degenerate eigenvectors and the a,’s are not all equal to 
zero. Assuming a k is not equal to zero, and multiplying by x k P from the 
left, we have 

a*x k Px k = 0 

which contradicts (6.21). Thus, we see that the n eigenvectors are independent 
of each other, and Theorems I-IV hold without modification, even in 
degenerate cases. 

It may be worth noting that different sets of eigenvectors will be obtained 
depending on the choice of C' in the above procedure. However, each set 
satisfies (6.20) and (6.21), and there is no preference for any particular set, 
unless we wish to introduce another perturbation in the system. 

An arbitrary vector can be expressed as a linear combination of the n 
independent eigenvectors: 

x = £ a k x k 

If we multiply this equation by x k P from the left, all the terms on the right- 
hand side, except the /rth one, disappear because of (6.20) and (6.21), leaving 

«* = x k Px 

A substitution into the original expression for x gives 

x = £x k (x k Px) (6.22) 

Multiplying by C from the left and using Cx k = y k x k , we have 

Cx = X y k x k (x k Px) (6.23) 

Since (6.23) holds regardless of the value of x, C can be written in the form 

C- = Ew,P. (6.24) 

which is called the spectral representation of C, where the dot signifies that 
C is an operator. 

6.2 Perturbation Method for the Eigenvalue Problem 

To obtain the solutions of (6.1 1), we have to solve an algebraic equation 
of «th degree. In principle, this is always possible, but, in practice, it may 
prove to be a formidable problem. Fortunately, when the couplings are 
small, good approximate solutions can be obtained by a perturbation method. 
In this section, we shall study this method in detail, assuming that the 
couplings are small. Two distinct types of wave interaction will be shown to 
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exist depending on the signs of the transmission powers of interacting waves. 

Let C 0 be a diagonal matrix having diagonal components identical to 
those of C itself, and let C k be C — C 0 , then the diagonal components of 
C t are all zero, and its off-diagonal components are small. We write C 0 and 
C t as follows: 



0 0 



0 C i2 

C 2 1 0 



xi°> = 



v (0) _ 
X 2 - 



xi 0) = 



The corresponding X; 0), s are 

x ( i 0) = [Pi 0 - 0], x ( 2 °> = [0 p 2 



(6.25) 



By inspection, the eigenvalues of C 0 are found to be C n , C 22 ,..., C„„, and 
the eigenvectors are given by 



(6.26) 



(6.27) 



Since C L is small, there may be an eigenvector of C, namely \ h which is 
similar to x- 0) . Let us write x f in the form 



‘■• = xS°> + Xrf 



(6.28) 



The eigenvalue y t is expected to be close to >>j 0) = Q-, hence we write y t 
in the form 

y, = yl 0> + Ay> ( 6 . 29 ) 

Substituting (6.28) and (6.29) into the left-hand side of (6.1 1), we obtain 

(C - y.I) X,. = (C, - Ay) x!°> + X - yj°>) x™ (6.30) 

nti 

where all the product terms between C l5 Ay h and c n are neglected since they 
are all assumed to be small. Since X; is an eigenvector of C with eigenvalue 
the left-hand side is equal to zero. Multiplying (6.30) by xl 0) P from the 
left and using the orthonormal property of the xj, 0) ’s, we have 



x! 0) PC 1 x! 0) - Ay t 



(6.31) 
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The left-hand side is equal to zero because the diagonal components of C, 
are all zero. To a first order approximation, therefore, y { is equal to the 
original eigenvalue y{ 0) . Next, multiplying (6.30) by xJ, 0) P from he left 
and using the orthonormal property, we obtain 

£(0)pr~, (0) 

(”*') («2) 

in / i 

Substituting this into (6.28) gives the first order approximation of x f . 

If one of the yj, 0) ’s, say y J 0) (j ^ /), is equal to or close to yj 0 *, the right- 
hand side of (6.32) for n=j becomes infinite or large, and the assumption 
that the c„’s are small is no longer valid. In such a case, we proceed as in 
the discussion of the waveguide wall losses, and write x f in the form 

x, = /lxP> + Bx<°» + £ c„xi 0> (6.33) 

n £ i, j 

where A, B, and the c„’s are constants to be determined. Let 2 Ay 0 be 
v! 0> ~ y anc ^ 2 fo be fi 0) + yf\ th e n 

y! 0) = y 0 + Ay 0 , yf } = y 0 - Ay 0 (6.34) 

Assuming that y 4 is close to y 0 , we write it in the form 

7i = Vo + (6.35) 

and determine Ay t . To do so, we first substitute (6.33), (6.34), and (6.35) 

into the left-hand side of (6.1 1) and obtain 

(C - y,I) X, = C, {^xl 01 + £x<°>} - (Ay, - Ay 0 ) 

-(A y, + Ay 0 )B^P+ £ c„ (y<°> - y 0 ) ^ (6.36) 

n*i,j 

where the products of small terms are again neglected. The left-hand side 
is equal to zero. Multiplying (6.36) from the left by xj 0) P and by x* 0) P in 
turn, we obtain 

- A{Ay { - Ay 0 ) + Bx\ 0) PC lX J 0) = 0 (6.37) 

>4lJ 0, PC 1 x l <0) - B(Ay t + Ay 0 ) = 0 (6.38) 

where the orthonorma! property between the xj, 0) ’s as well as xj 0, PC 1 Xj 0) = 0 
and x^ 0) PC 1 Xy 0) = 0 is used. If the simultaneous equations (6.37) and (6.38) 
for A and B are to have nontrivial solutions, the determinant of the coeffi- 
cients must vanish. This condition gives 

Ay? = x. 0) PCjXy 0) • Xy 0) PC 1 x- 0) + Ay 0 z 
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and x' 0) PC 1 Xj 0> is calculated to be C u and x' 0) PC 1 x- 0) is equal to C jh 
hence the first term on the right-hand side becomes which is equal 

to — \C (j \ 2 or \Cij\ 2 from (6.9) and (6.10) depending on the sign of PfPj. 
Since Ay 0 is always pure imaginary, let us write it as j Ap 0 where Ap 0 is 
real, then the second term on the right-hand side becomes — (d/? 0 ) 2 . Thus, 
we have 

Ay? = - IQ/ - Wo ) 2 (p, = Pj) (6.39) 



Ay? = \C u \ 2 -(Afi o y 



(i Pi = - Pj) 



(6.40) 



When pi = pj, Ay t , and hence y t become pure imaginary. On the other hand, 
when Pi = - pj and |C/ > (d/? 0 ) 2 , Ay { becomes real, and hence y t becomes 
complex. Since (6.40) only determines the square of Ay i} two different y f ’s 
with opposite signs in front of the real part are obtained, as Theorem II in 
Section 6.1 indicated. Figures 6.2(a) and (b) illustrate the relation between 
y ( = a+ jP and Afi 0 for the cases in which p- t = pj and />, = — pj, respectively, 
where a and ft are the real and imaginary parts of y f . The term /? 0 is the 
mean value while Afi 0 is the difference of the phase constants of the two 
original unperturbed waves. 

The ratio of A to B is calculated from (6.37) to be 



A _ x! 0) PC 1 xJ 0) 
B Ay-, - Ay 0 



(6.41) 



Corresponding to two different values of A y h two different ratios are obtained. 
The value of c n is easily determined by multiplying (6.36) by x* 0) P from the 



left: 



+ Bx$ 0) ] 



y! 0) - y 0 




When the yj, 0l ’s ( n # i,j) are not close to y 0 , the c„’s are all small and can 
be neglected since they only introduce second order variations to the trans- 
mission powers. On the other hand, Ay t cannot be neglected, even though its 
magnitude is small since its effect appears as the exponential of Ay t z 
which becomes large as z increases. The wave amplitude corresponding to 
real and negative Ay t grows with z. This suggest that there is a possibility 
of obtaining power amplification when a small coupling is introduced between 
two waves propagating in the same direction with nearly equal propagation 
constants but with positive and negative transmission powers. In ordinary 




1 
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passive networks, no such amplification takes place since waves do not 
carry negative transmission powers. An electron beam, however, can support 
waves with negative transmission powers, and amplification does take place 
when one such wave is coupled to a wave propagating along a nearby 
passive circuit with a similar propagation constant as we shall discuss in 
Chapter 8. 

From (6.31) and (6.32), we saw that waves with sufficiently different 
propagation constants do not interact with each other under the small 
coupling condition. This is understandable since the phase relation between 
waves changes rapidly with distance, and if there is some kind of interaction 
at one point, just the opposite kind of interaction takes place at another 
point thus causing the effects to cancel each other. On the other hand, if 
the propagation constants are similar, the effects of interaction accumulate 
with distance to such an extent that the originaly independent waves can no 
longer be considered as independent, and a pair of new waves with different 
propagation constants result. 

In the above discussion, only two waves were assumed to have similar 
propagation constants, however, a similar method can be applied to cases 
in which the propagation constants of three or more waves are similar. In 
such cases, algebraic equations of correspondingly higher degree have to 
be solved. An example of three-wave interaction will be given in Chapter 8 
in connection with coupling between an electron beam and a transmission 
circuit. 



6.3 Interactions between Two Waves 

As a simple, but important, example of distributed coupling, let us con- 
sider the interactions between two waves propagating in the positive 
2 -direction. When there are several coupled waves, only waves with similar 
propagation constants interact, and the remainder can be considered to 
propagate independently, as we learned in the previous section. Since there 
are a number of cases in which only two waves have similar propagation 
constants, our present study has definite practical value besides having the 
advantage of mathematical simplicity. 

Let a x (z) represent one wave at z and a z (z) the other, and let /? 0 + Afi 0 
and /? 0 - Afi 0 be their respective propagation constants. First, we shall 
consider the case in which the total transmission power in the positive 
z-di rection is given by Mz)| 2 + \a 2 {z)\\ In this case, Pl =p 2 = 1 and the 
coupling coefficients C 12 and C 2I satisfy the relation C 12 = -C 2 * t from 
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(6.9). Thus we have P = diag[l 1], and 

r _[ m + M>) o ] 

° L 0 j(p 0 - Ap 0 )\ ’ 
The eigenvectors of C 0 are given by 





r — 


0 


c 12 


> 




— c* 
L L 12 


0 . 



and the corresponding x ( , 0) and x 2 0) are given by 

x ( 1 0) = [1 0], x ( 2 O) = [0 1] 

Let us write Ay^ ±j Ap, then we have from (6.39) 
*P = {\C l2 \ 2 + (Ap 0 ) 2 }' 12 



(6.43) 



The eigenvalues of C 0 + are approximately given by j(p 0 + A P) and 
j(p 0 — AP), and the corresponding eigenvectors are calculated to be 

= UW - * J?o)/C J ’ * 2= -j(Ap + d/! 0 )/C J 



where use is made of (6.41). The corresponding x t and x 2 are 

- _r IC12I 2 -jC l2 (Ap-Ap 0 ) 

Xl LlC 12 | 2 + (AP - APo) 2 |C 12 | 2 +(Ap - Ap 0 ) : 

5 _[ |C 12 | 2 jC 12 (Ap + Ap 0 ) 

XZ l\C i2 \ 2 + (Ap + Ap 0 ) 2 |C 12 | 2 + (AP + AP 0 ) : 



Since we have obtained all the necessary quantities in their explicit forms, 
let us next study how a v (z) and a 2 (z) change with z, assuming that a x (z) = A 0 
and a 2 (z ) = 0 at z = 0. The condition at z = 0 can be expressed by 

a (°)=K°] 

In terms of the eigenvectors x x and x 2 , a(0) becomes 
a(0) = x x P j^°J x x + x 2 P x 2 



— A 0 |C 12 | 



|C 12 | 2 + (Afi - APof |C, 2 | 2 + (AP + AP o y 
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The wave corresponding to x x varies with z as exp { - j(p 0 + A P)zj and that 
corresponding to x 2 varies as exp {-j(p a -Ap)z}. As a result, a(z) at 
arbitrary z becomes 

a(z)= feS] =A » ic22|2e '"" ! 

e~ jApz 

\ c u\ 2 + (Ap — Ap 0 ) 2 
e iJ P z 

I 2 1 2 + (AP + Ap o ) 2 j 
Therefore, a, (z) is given by 

^o|C 12 | 2 e~>^ 

x |C 12 | 2 (e ~ 2 ^ + e ^) + (AP + Ap 0 f e~ J »' + (Ap - Ap 0 f e**"' 
{|C 12 | 2 + (AP - Ap o y } {|C 12 | 2 + (Ap + Jptf}~ 

Using (6.43) the denominator and numerator can both be simplified to 
4 A/S | C l2 ( and 4 A jl 2 cos A/Sz — 4/ A[S Ap 0 sind/lz, respectively, and a. (z) 
becomes 

a x (z) = A 0 e~ jpoz {cosAp z - j(ApjAp ) sin Ap z} 

To interpret this expression, we multiply by jle*"' and take the real part: 
Re{V2 a 1 (z)e Jmt } 

= \/2^ 0 {1 ~ (\C i2 \/AP) 2 sin 2 Ap z} 1/2 cos(cui — p 0 z — (p t ) (6.44) 
where (p Y is defined through 

tan<p, = (ApJAP) tan Ap z (6.45) 

Similarly, for a 2 (z) we obtain 

Re{^2u 2 (z)e J<ot } = y/2A 0 (\C 12 \/Ap) sin Ap z cos(cot ~ p 0 z - (p 2 ) (6.46) 

where cp 2 is defined through 

c *2 = |C I2 | e~ JV2 (6.47) 

We see from (6.44) and (6.46) that both waves a x (z ) and a 2 (z) propagate 
with the same phase constant p 0 in the positive z-direction, but their magni- 
tudes fluctuate with z. Since the power carried by each wave is given by the 
sqaure of the magnitude, it also fluctuates with z as shown in Fig 6 3 
When ^o = 0,|C 12 |W=l,|a 1 (z )| 2 = 0, and \a 2 (z)\ 2 = A 0 2 at ' z = 
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Fig. 6.3. Exchange of transmission power between two coupled modes. 



{(2/t-l) till Ap). Furthermore, at z=(mt/Ap), \a x { 2 )\ 2 returns to A 0 2 and 
\a 2 (z)\ 2 becomes zero, where n is an arbitrary integer. In other words, 
when the original unperturbed waves have the same phase velocity, the 
power introduced into one wave at z = 0 is completely transferred to the 
other wave and back again by the time it reaches z = ( n/2 Ap) and z = (n(AP), 
respectively. This exchange of power continues back and forth between the 
two waves until the coupling is removed at some point along the trans- 
mission system. 

Let us next consider the case where the total transmission power is given 
by ^(z)! 2 — \a 2 (z)\ 2 . In this case, P = diag[l - 1], and 



c - MA> + Ap 0 ) 0 

°'l 0 j(P 0 - Ap 0 )\ ’ 




Suppose that \C 12 \ 2 > ( Ap 0 ) 2 y then from (6.40), we have 

« = {|C 12 | 2 -Wo) 2 } 1/2 (6-48) 

where Ay t is indicated by ±a. The eigenvalues of C 0 + C t are given by 




a +JPo an d — a +jPo- The corresponding eigenvectors are calculated to be 



X ' [(«—; JW/C.J’ [(-a-jAM/Ct] 

and Xj and x 2 are given by 

6^12 ( — a +j AP 0 ) "I 

\C l2 \ 2 + (-x+jApJ 2 } 

Cuiz+jApo) 1 
[C 12 J 2 + (a +~[Apo) 2 ] 

If a x (z) = A 0 and a 2 (z) = 0 at z = 0, as in the previous example, we have 



1 [|C, 

2 = [ic, 



|C 12 | 2 



2 | + (— a + j Ap o y 



I C 



1 2 1 



2 1 +(< x +jAp 0 y 



a, (z)' 

« 2 (z). 



= ^o|C I2 | 2 c-^ z 
e _< 



X { 



IC 12 I 2 + (— a + j Ap 0 ) 2 1 \C 12 \ 2 + (<x + j Ap o ) 



\2 A 2j 





Fig. 6.4. Growing transmission power. Note that \ai (z)| 2 — |a 2 (z)| 2 is kept constant. 
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To interpret this result, we take the real parts of yj2e itot times a t (z) and 
a 2 (z): 

Re {>/5a , (z) e io>t } = yftA 0 {\ + (|C 12 [/of) 2 sinh 2 az} 1/2 cos (vt - (S 0 z - <p { ) 

(6.49) 

Re ( A /2a 2 (z) e ja> ‘} = - sj2A 0 (\C 12 \fa ) sinhaz cos (ot - p 0 z - <p 2 ) (6.50) 

where (p x and (p 2 are defined through 



and 



tan<pi = (Afto/a) tanhaz (6.51) 

C* 2 = |C 12 | e- J » (6.52) 



Although the total power \a x {z)\ 2 -\a 2 {z)\ 2 is kept constant, the trans- 
mission power carried by a x (z) alone increases with increasing z as shown 
in Fig. 6.4. If the coupling is removed at some point z, and a x (z) is fed into 
a load, the output power becomes (1 + (|C, 2 |/a) 2 sinh 2 az} times the input 
power introduced at z = 0. This type of power amplification by wave 
interaction can be realized in traveling wave tubes, which we shall discuss 
in Chapter 8. 




6.4 Periodic Structures 

In this section, we shall study spatially periodic structures, such as the 
one illustrated in Fig. 6.5. We can consider each period of the structure as a 't 

cavity and the whole structure as a chain of identical cavities. The electro- -j 

magnetic field in each cavity interacts with those in the adjacent cavities in 
a complicated manner; however, a systematic study becomes possible if use 4 

is made of an eigenfunction approach. Although the input and out put $ 

waveguides are aligned in Fig. 6.5, this is not essential in the present discus- £ 

sion. Here, all that is required is periodicity; the cavity between reference i 

Jr 
• \ 

REFERENCE PLANE i 




Fig. 6.5. A periodic structure. 
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planes 1 and 2 is identical with the cavity between 2 and 3, and so forth. 

The transverse electric and magnetic fields at reference planes 1 and 2 can 
be expanded in terms of the waveguide eigenfunctions: 

E||(l) = EE Jn F B (l), E„ (2) = E tn V n (2) 

H[ f (1) = Z k x E,„/„(l), H y (2) = £k x E,„/ n (2) (6,53) 

where V„(i), I n (i ) (/ = 1 , 2) are the expansion coefficients at reference plane /. 
Instead of taking an infinite number of terms with respect to n in the above 
summations, let us terminate at the Ath term, then the field expressions 
become approximations. Since these approximations can be made as good 
as we desire by choosing sufficiently large N, a sacrifice in accuracy need 
not be considered in the following discussion. The transverse electric and 
magnetic fields are now uniquely determined by v l5 v 2 , i, and i 2 which are 
defined by 




The sign of i 2 is reversed so that it represents currents flowing into the next 
cavity. Suppose v t and v 2 are somehow given, then the transverse electric 
fields at the input and output reference planes are fixed, and from them the 
electromagnetic field in the cavity can be calculated, in principle, as we did 
in Chapter 4. The transverse magnetic fields at the reference planes, and 
hence q and i 2 can therefore generally be determined. Assuming the linearity 
of the field relation, this fact can be expressed in the form 

»i = Y n v i + Y 12 v 2 , -i 2 = Y 21 v, + Y 22 v 2 (6.55) 

These equations indicate that if Vj and v 2 are given, in and i 2 can be deter- 
mined. However, the expressions in (6.55) are not convenient for deriving 
an eigenvalue problem for the study of periodic structures since quantities 
at two different reference planes appear on both sides. We would like to 
place quantities related to one reference plane on one side and those related 
to the other reference plane on the other side. The cavity would then be 
represented by a matrix which transforms a vector representing quantities 
at one plane to another vector representing the same quantities at the other 
plane. Hence, n identical cavities in cascade could be represented simply by 
the nth power of the matrix; this is accomplished by defining a transfer- 
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matrix through the equation 



w 2 = Twj 



(6.56) 



By substitution, we see that 

t 11 = -y 1 - 2 1 y 1 



T 2 i = — Y 21 + Y 22 Y72 Yj 



= f T n T12I 

' [t 21 t 22 j 



T 12 = Yr 2 1 

T 22 =-Y 22 Y7 2 1 



(6.57) 



where an assumption is made that Y 12 is nonsingular. 

T is a square matrix of order IN, and hence it is expected that there are 
2 N independent eigenvectors ; any vectors w can be expressed by their linear 
combination. Unless some restriction is imposed on T, no fruitful discussion 
of the eigenvalue problem is possible ; therefore, let us consider reciprocal 
and lossless conditions either of which can be used in the present discussion 
in the same way as (6.7) was used previously. 

The reciprocal condition in terms of T is given by 



where 



T,RT = R 



-C V] 



(6.58) 



(6.59) 



The proof is as follows. Rewriting (5.70) in terms of v u v 2 , i x , and i 2 , we 
have 

V <1):(2) _ v (1) i (2) - V <2 >i (1) 4- V (2) i (1) - 0 
Vlf 1 ! _ ' v 2r 1 2 v lr *1 + V 2J *2 

Using (6.56) and (6.57), this can be expressed in terms of and only, 

W - WI’T,,, + it;*,*) (T 21 v' 2) + T 22 i ( 1 2> ) 

- v'.W + (v (2, T ui + i‘?’T 12 ,) (Tirf 1 + T 22 i‘‘>) = 0 

which is equivalent to 

^'(I-T^T^ + T^T^iP 

-vff(I-T 1 „T 22 + T 21 ,T 12 )i« , > 

— — T 211 T 11 )v ( 1 2 > 

-i'i 1 , , ('I',2,T 22 -T 22 ,T 12 )i < l 2| = 0 

Noting that v ( j U , i^, v < 1 2) , and i <2) are arbitrary, we obtain from this 
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relation 

l-T 1Ir T 22 +T 2If T 12 =0 
Tn»T 2 i — T 21f T n =0 
^12^22 — T 22 ,T 12 = 0 

This completes the proof, since (6.58) is equivalent to these three equations. 
The lossless condition in terms of T is given by 

T + LT = L (6.60) 

where 

L =[i 0] < 6 - 61 > 

The proof is as follows. If the circuit is lossless, the net power flowing into 
it must be equal to zero, i.e., 

i {(*1 +v i + v i +i i) - («2 + v 2 + v 2 + i 2 )} = 0 

In terms of and w 2 , this can be rewritten in the form 

Wj + Lw! — w 2 + Lw 2 = 0 

which is equivalent to 

w 1 + (L-T + LT)w 1 =0 

where use is made of (6.56). Noting that this relation holds for arbitrary 
w l5 we see that (6.60) is therefore proved. 

We are now in a position to discuss the eigenvalue problem of T, following 
the discussion given in Section 6.1. Let us first use (6.58). 

The eigenvalue problem of T is given by 

(T — yl) x = 0 (6.62) 

The eigenvalues of T are obtained by solving the algebraic equation for y, 

det (T - yl) = 0 (6.63) 

If all the roots of this equation are distinct, Theorem I, in Section 6.1, and 
its proof holds for the eigenvectors of T without modification. 

Corresponding to Theorem II, we find that if y k is an eigenvalue, y^" 1 is 
also a eigenvalue. This can be proved as follows. First, we note that T is 
nonsingular and T 1 exists since det T ^ 0 as we can see by taking the 
determinant of (6.58) and using det R^0. Applying T -1 from the right, 
(6.58) becomes 



T,R = RT 



1 



(6.64) 
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Using this equation, we obtain the following relation from (6.63) 

det(T t — y fc I) detR = det(T,R — y t R) 

= det(RT _I - y k R) = detR det(T _l - y*I) = 0 

Since detR # 0, we have 

det(T~ 1 — y k I) = 0 

or equivalently, 

det(T-y k - , I)=0 

which shows that y k 1 is also an eigenvalue of T. 

Corresponding to Theorem III, we find that if y k and y, are the eigenvalues 
corresponding to eigenvectors x k and x h respectively, and if y k ^ yf 1 , then 



x kt Rx, = 0 

This can be shown as follows. Eigenvectors x k and x, satisfy 



(6.65) 



Tx k = y k x k , Tx, = y,x. 



which are equivalent to 



X/u T, = Yk*k < , T 'x i = y l 1 x, 

respectively. Multiplying the first equation by Rx, from the right and the 
second equation by x kl R from the left, we obtain 

x* t T f Rx, = y k x*,Rx f , x k ,RT “ 1 x, = y “ ' x kt Rx, 

The left-hand sides of these two equations are equal from (6.64); therefore 
we have 

(y k -y^)x kt Rx, = 0 

By hypothesis y k ^ y, -1 and (6.65) is therefore deduced. 

Finally, corresponding to Theorem IV, we find that if y k = y[\ then 



x k( Rx, * 0 



( 6 . 66 ) 



From the first theorem, (Rx f ) + can be expressed as a linear combination of 
eigenvectors. 

( Rx ,) + = X a m X mt 

where the a m ’s are the expansion coefficients, and the eigenvectors are all 
transposed. Multiplying by Rxj from the right, we have 

(Rx,) + (Rx,) = X « m x ml Rx, 
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All the terms on the right-hand side disappear except the fcth one, according 
to the previous theorem (6.65). The A:th term cannot be equal to zero since 
the left-hand side is nonzero; this is impossible unless (6.66) holds, and the 
proof is therefore complete. 

Let us define x k by 

x* = cx kt (6.67) 

when y k is either 1 or — 1, but when y* 2 # 1, then 

x k = cx lt (6.68) 

where x, is the eigenvector corresponding to y, = y* 1 whose existence is 
guaranteed by the second theorem. By selecting c properly, let 

x k Rx k = 1 (6.69) 

If / ^ k, we have from (6.65) 

XjRx* = 0 (6.70) 




The above discussion is based on the assumption that the 2 N roots of 
(6.63) are all distinct. When (6.63) has multiple roots, we can still obtain 2 N 
independent eigenvectors using a discussion similar to that presented in 
Section 6.1. Therefore, an arbitrary vector x in the 2 N dimensional space 
can be expressed in the form 



X = Z x k (x fc Rx) 


(6.71) 


where the summation is over k from 1 to 2 N. Multiplying by T from the 


left, we have 




Tx = Z T*x k (x k Rx) 


(6.72) 


The spectral representation of T is given by 




T * = Z 7 *x k x*R . 


(6.73) 


and that of T" by 




T • = Z 7* n x*x k R • 


(6.74) 



The transverse electric and magnetic fields corresponding to an eigen- 
vector of T are multiplied by the corresponding eigenvalue each time the 
reference plane is shifted one period in Fig. 6.5. Therefore, the electro- 
magnetic field inside the «th period must be the nth power of the eigenvalue 
times the electromagnetic field inside the first period. Each independent 
electromagnetic field pattern with this property is called a mode of the periodic 
structure. It follows from the above discussion that any electromagnetic 
field in a periodic structure with reciprocity can be expressed as a linear 




280 



6. COUPLED MODES AND PERIODIC STRUCTURES 



combination of the modes to any desired accuracy, assuming the esixtence 
of the transfer matrix. 

Let us next use (6.60) in place of (6.58) to reach a similar conclusion. 
Assuming that the 2 N roots of (6.63) are distinct, Theorem I holds without 
modification. Corresponding to Theorem II, we find that if y k is an eigen- 
value, (y k *) -1 is also an eigenvalue of T. To prove this, we note that T is 
nonsingular from (6.60), and hence the lossless condition can be written in 
the form 

T + L = LT -1 

From this and (6.63), we obtain the following relation. 

det (T + - y*l) detL = det(T + L - y k *L) 

= det (LT - 1 - y k *L) = det L det (T - 1 - y k *I) = 0 

Since detL ^ 0, this shows that y fc * is an eigenvalue of T -1 , or equivalently, 
(y**) -1 is an eigenvalue of T. Theorems III and IV become 

X i + Lx k = 0 (y -1 #y fc *) 

X, + Lx*#0 (y -1 =y fc *) 

The proofs are similar to those for the previous case, and we shall not 
repeat them. Once these relations are obtained, we can define x k by 

x * = cx k + (|y*| = 1) (6.75) 

X* = cx ; + (|y*| =£ 1) (6.76) 

where x, is the eigenvector corresponding to y, = (y fc *) -1 , whose existence 
is guaranteed by the second theorem. Let us choose c properly so that 

x k Lx k = 1 (6.77) 

Then, using a discussion similar to the one given in Section 6.1, we have 

x = £x k (x*Lx) (6.78) 

regardless of whether or not some of the eigenvalues are degenerate. The 
spectral representation of T is given by 

T • = Z 7fcX k x*L • (6.79) 

Modes in a lossless periodic structure can be defined in the same way as 
for the reciprocal case. We, therefore, conclude that any electromagnetic 
field in a lossless periodic structure can be expressed as a linear combination 
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of the modes to any desired accuracy, assuming the existence of the transfer 
matrix. 

Each mode can be excited separately by generating a proper field pattern 
at a reference plane. Suppose that only one mode k is excited, then the surface 
integral j E x H* «( — n) dS over reference planes 1 and 2 (n points out- 
wards) becomes 

Oi +v i -i 2 + v 2 ) = ii +v 1 (i -w**) 

When |y k | 2 = 1, no attenuation takes place between one reference plane and 
the next one, and the mode is said to be in the passband. In this case, the 
above surface integral vanishes and (2.84) shows that the average electric 
and magnetic energies stored in one spatial period are equal, assuming 
nonzero co. When )y fc | 2 ^ 1, since y k ^ (y fc *) -1 , we have 

x k + Lx k = 0 

or equivalently, 

[¥ * +i ‘ +] [i o] 

Therefore, no real power is transmitted, and the mode is said to be in the 
stopband. 

Let us next apply (5.96) to one spatial period in Fig. 6.5. To do so, we 
assume both the reciprocal and lossless conditions. For the &th mode, the 
left-hand side becomes 



dv, di, 
1 dco 1 dco 



. + dv 2 + di 2 

u V, 

dco dto 





(1 -y**yft) + (ii + v 1 + Vi + ii) 




In the passband, the first term on the right-hand side vanishes, since |y k | 2 = 1, 
and (5.96) reduces to 

- y* (dy Jdco) = i/P -1 f (fiH* • H + eE* • E) dv (6.80) 



where P is the transmission power given by 



P = Re{i 1 + v 1 } 



Let L be the length of one period and /? be defined by 

y* = e xp (-jfiL) 
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Then (6.80) becomes 

(dPjdoj) = iP~ l r l J (fill* • H + eE* • E) dv (6.81) 

Comparing this with (3.23) in the discussion of waveguides, we see that the 
left-hand side corresponds to the reciprocal of the group velocity. From this 
observation, we may say that the group velocity of a mode in the passband 
is equal to the transmission-power times the length of one period divided 
by the electromagnetic energy stored therein. 

In a lossless, reciprocal, periodic structure, if y k is an eigenvalue, y k 1 
and (y fe *) _1 must also be eigenvalues. If y„ is not real and |y*| ^1, then at 
least four different eigenvalues y k , y k *, y k \ (y**) -1 have to exist. When we 
take into account only one E,„ in our calculation, as we shall do in the next 
section, T becomes a two-by-two matrix, and hence only two eigenvalues 
can exist. In this case, either the eigenvalues have to be real or their magni- 
tude is unity, otherwise four different eigenvalues are required. 

6.5 oj-fl Diagram 

Let us consider a waveguide with identical windows periodically spaced 
as shown in Fig. 6.6, and suppose that only one waveguide mode is necessary 
to approximate the fields at the reference planes located midway between 
any two adjacent windows. Let b 0 represent the normalized susceptance of 
the window with respect to the waveguide admittance. Then an equivalent 
circuit representing one period of the structure becomes a transmission line 
with electrical lenght fi 0 L shunted by an admittance jb 0 Y 0 at the midpoint 
as shown in Fig. 6.7. The term p 0 is the phase constant and Y 0 is the charac- 




Fig. 6.6. Waveguide with periodically spaced windows. 



ip* 
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Fig. 6.7. Equivalent circuit for one section of the periodically loaded waveguide. 



teristic admittance of the waveguide mode under consideration, and L is 
the length of one period. 

The voltage and current at port I can be expressed in terms of the incident 
and reflected waves A and B (see Section 3.5) as follows: 

V t =A + B, I V = Y 0 {A-B) (6.82) 

The voltage and current at the left-hand side of jb 0 Y 0 are given by 

V = Ae~ J$ + Be j \ I = Y 0 (Ae~ j0 - Be je ) (6.83) 

where 9 = p 0 L/2. The voltage and current at the right-hand side of jb 0 Y 0 
become V and I—jb 0 Y 0 V, respectively, since current jb 0 Y 0 V is diverted 
into the shunt admittance. In terms of the waves A' and B' on the right-hand 
side of the window, V and I—jb 0 Y 0 V must be expressible in the forms 

V = A' + B’, I — jb 0 Y 0 V — Y 0 (A' — B') (6.84) 

and the voltage and current at port II are given by 

V 2 = A'e~ j0 + B'e j9 , I 2 = Y 0 (A'e~ je - B ,j0 ) (6.85) 

Note that the direction of I 2 is chosen to be outward so as to conform with 
the sign of i 2 in Section 6.4. 

Eliminating V and / from (6.83) and (6.84), A' and B' can be expressed in 
terms of A and B. Substituting this result into (6.85) and then using (6.82), 
we can write V 2 and / 2 in terms of V 1 and I t . The result is given by 






284 6. COUPLED MODES AND PERIODIC STRUCTURES 

where T is a square matrix of order 2 with the following components: 

Tji = T 2 2 = cos p 0 L — \b 0 sin p 0 L 

T l2 =jYo l (lb 0 - sin p 0 L - $b 0 cos p 0 L) (6.87) 

T 2l = ~jY 0 (±b 0 + sin p o L + \b Q cos p 0 L) 

The two eigenvalues of T, y t and y 2 , are obtained by solving the quadratic 
equation det (T — yl) = 0. 

y, >2 = cos P 0 L~ jb 0 sin p 0 L + {(! b 0 f - (sin p 0 L + $b 0 cos p 0 L ) 2 } 1/2 

(6.88) 

When 

{jb 0 ) z < (sin p o L + \b 0 cos p 0 L) 2 (6.89) 

the eigenvalues become complex and their magnitudes are found to be 
unity, indicating that the corresponding modes are in the passband. On the 
other hand, if 

(ib 0 ) z > (sin p 0 L + ±b 0 cos p 0 L) 2 (6.90) 

both of the eigenvalues are real and their magnitudes are not equal to 
unity, which means the corresponding modes are in the stopband. 

If the opening of a waveguide window is small, b 0 is large and the passband 
condition (6.89) is satisfied only when |cos/? 0 L| is close to unity; or equiva- 
lently, when the length of one period of the structure becomes approxi- 
mately an integer multiple of a half- wavelength, thereby forming a resonator. 
When the length deviates from such a value, the sections between two adja- 
cent windows cease to resonate, and no transmission of power becomes 
possible thus resulting in a stopband. Similarly, if b 0 is small, the stopband 
condition (6.90) is satisfied only when the length of one period becomes 
approximately an integer multiple of a half-wavelength. In this case, although 
the reflections from individual windows are small, the reflections from suc- 
cessive windows arrive back in phase and the overall result is no trans- 
mission of power. When the length of one period deviates appreciably from 
an integer multiple of a half-wavelength, the small reflections do not add in 
phase and introduce only a minor modification in the wave propagation. 

Let us now concentrate on the passband. Since one of the eigenvalues is 
the complex conjugate of the other and their magnitudes are unity, y t and 
y 2 can be expressed as 

yi=e~ jpL , y 2 = e jpL (6.91) 

where the phase shift from one reference plane to the next is indicated by 
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PL. Substituting (6.91) into (6.88), we obtain 

cos PL = cos p 0 L — \ b 0 sin p 0 L (6.92) 

from which we learn how pL changes with a). 

From (3.127), the relation between p 0 and co is given by 

Po 2 = ofepi - k 0 2 

where k Q 2 is the eigenvalue of the waveguide mode. Therefore, co as a 
function of p 0 L should appear as the broken line in Fig. 6.8(a). If b 0 is a 
slowly varying function of co, the right-hand side of (6.92) as a function of 
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P 0 L may, look like the broken line in Fig. 6.8(b). The solid line indicates 
cos pL as a function of pL. For a given o), P 0 L is determined from Fig. 6.8(a), 
and hence the corresponding value of pL can be found from Fig. 6.8(b) 
following the dotted lines with arrowheads. Thus, co versus pL should 
appear as the solid lines in Fig. 6.8(a). Note that the broken line a> versus 
P 0 L and one of the solid lines w versus pL intersect at PL = p o L = nn 
( n : integer). Also note that the tangent to the solid lines becomes horizontal 
at pL = nn, indicating that the group velocity becomes zero at the corre- 
sponding co's. Those ranges of co for which there is no corresponding pL 
in Fig. 6.8(a) represent the stopbands. 

A graph which shows o versus p in a similar way is called the a>-p diagram 
for the periodic structure. From the above discussion, we see that the general 
appearance of the co-P diagram for a periodic structure should like look 
Fig. 6.9. Each of the curves extends from P= — oo to +co as a periodic 
function of p with a period 2njL. Since PL and PL + 2 nn give the same 
eigenvalues in (6.91), one period of the curves, say from p = — n/L to n/L, 
contains all the information about the eigenvalues, and the remaining 
periods are redundant. The particular range of p mentioned above is called 
the first Brillouin zone. As far as the phase shift of the electromagnetic field 
at the reference planes is concerned, only the first Brillouin zone is of interest. 
The situation is slightly different, however, if we are interested in the more 
general problem of the fields throughout the waveguide. Let us consider 
the electric field, first concentrating on the mode corresponding to y,. Each 
time we move to the next section of the periodic structure, the phase of the 
electric field changes by pL since the phases of the electromagnetic fields at 
the boundary reference planes shift the same amount. Thus, we have 

E(x, y, z + nL) = e~ JpnL E(x, y, z) 

where the z-axis is taken in the longitudinal direction of the waveguide. 
The above expression is equivalent to 

e jPz E(x, y, z) = e ip(z+nL) E(x, y, z + nL) 

which indicates that e Jpz E(x, y, z) is a periodic function of z with period L. 
Using a Fourier series expansion, it is expressed in the form 

e jp7 E(x, y, z) = £ A„(x, y) exp(j2nnz/L) 

n= - oo 
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or equivalently, we have 

E (x, y, z) = X A„(x, y)exp[-j{P-(2nnlL)}z~\ 

n= - <x> 

In other words, the electric field consists of many components, each with 
the same group velocity but different phase velocities; the velocity of the 
nth component is given by 

v p = co{P-(2nn/L)}- 1 

The «th component is sometimes called the «th space harmonic. The phase 
velocity appears as the slope of the dotted line drawn from the origin to the 
corresponding point on the co-P diagram as shown in Fig. 6.9. Depending 



U) 




Fig. 6.9. An example of an co-P diagram. 

on the value of n, the phase velocity becomes very small, and for some com- 
ponents it may even become negative; these components appear to travel 
in the direction opposite to the power transmission. The magnetic field cor- 
responding to y 1 and the electric and magnetic fields corresponding to y 2 
can be discussed in a similar manner. 

In many electron beam devices, such as the traveling wave tube which we 
shall discuss in Chapter 8, the necessary interaction between the circuit and 
the electron beam requires a slow electric field component traveling with a 
velocity approximately equal to that of the electrons. Since various periodic 
structures are used to obtain such a slow component, periodic structures 
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are often called slow- wave structures. In the traveling wave tube, the structure 
must be designed so as to make the z-component of A„ (x, _p) at the electron 
beam as large as possible and the other field components as small as possible 
in order to minimize unnecessary losses. It should be noted that A„(x, y) 
has the proper phase velocity. 

In the above analysis, an assumption was made that only one waveguide 
mode was necessary to approximate the fields at the reference planes. As co 
increases, however, more and more waveguide modes start to propagate and 
sooner or later the assumption becomes invalid. Several propagating modes 
may interact with each other in a complicated manner, and the co-fi diagram 
fails to resemble the one derived in this section. To avoid such a complicated 
situation, the operating frequency is generally kept low and/or the structure 
is made to satisfy certain symmetry conditions under which the excitation 
of higher order propagating modes is prohibited. 



PROBLEMS 

6.1 Following the argument in Section 6.3., discuss the exchange of transmission power 
between the two coupled modes under the condition that ai(z) = Ao and 02 ( 7 ) = 
— Ao at z = 0. 

6.2 In the above problem change the condition at z = 0 to ai(z) = at(z) = Ao. 

6.3 Give an example of a periodic structure without Yy (ij = 1,2). Also give an example 
with Yy but with a singular Y12. 

6.4 When only one propagating mode is excited in a periodic structure, the average 
electric and magnetic energies stored in one period are equal to each other as shown 
in Section 6.4. Investigate the cases in which two modes are excited, and give an example 
in which the electric and magnetic energies stored in one period are not equal to 
each other. 

6.5 Prove that when several modes are excited in a lossless periodic structure, the total 
transmission power is equal to the sum of the transmission powers due to the 
individual modes. 

6.6 Suppose that each period of a periodic structure has a plane of symmetry at its 
midpoint and obtain the condition that must be satisfied by T. Also investigate what 
kinds of relations exist among the eigenvalues. 

6.7 Construct the co-fi diagram for an ordinary transmission line (TEM mode) having 
identical shunt capacitances periodically spaced. 
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CHAPTER 7 

LINEAR AMPLIFIERS 



In many practical applications, a signal level may be too low to perform a 
desired operation without introducing excessive noise or distortion. In such 
cases the signal is often amplified to a more useful level. To indicate the 
amount of amplification introduced by the amplifier, we define the trans- 
ducer gain as the ratio of the actual signal power absorbed in the load to the 
available signal power from the generator. An amplifier is supposed to have a 
transducer gain larger than unity. When the transducer gain is large over a 
wide frequency range, the amplifier is described as having a broadband and 
high gain. 

If excessive noise or distortion is introduced during the amplification 
process, the amplifier may become useless, even though it provides high gain. 
Consequently, the noise performance and distortion are considered to be 
important factors in determining the quality of amplifiers. In this chapter, 
we shall concentrate mostly on the amplifier noise performance, assuming 
that the input-output relation is linear, i.e., neglecting any possible distortion. 
This does not mean that distortion is unimportant; on the contrary, in many 
cases it becomes a vital factor. Unfortunately, no unified theory of amplifier 
distortion is available in a form suitable for presentation here. 

The conventional theory of amplifier noise performance almost exclusively 
express results in terms of the available gain, which is defined as the ratio of 
available signal power at the amplifier output port to the available signal 
power from the generator. This quantity is convenient because the available 
gain of a cascade amplifier is equal to the product of the available gains of 
the component amplifiers. In addition, the discussion of noise performance 
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of the cascade amplifier can be made particularly simple since the use of 
available gain conceals the real complexity which exists. On the other hand, 
when the gain of practical amplifiers is discussed, it is generally in terms of 
their transducer gain. If the available gain is used, the available signal power 
at the output port is of primary interest rather than the actual signal power, 
and hence the user is left with the burden of providing a proper load im- 
pedance to utilize it; this often turns out to be the most difficult part of the 
whole operation. Both the generator and load impedances are therefore 
specified beforehand, and the amplifier is designed to deliver the amplified 
signal to the specified load impedance, i.e., to provide a proper transducer 
gain. Usually, the generator and load impedances are equal to the charac- 
teristic impedances of the transmission lines at the amplifier input and 
output ports, respectively. Under these circumstances, if we develop the theory 
of amplifier noise performance with emphasis on the available gain, many con- 
fusing or misleading conclusions are reached. For this reason, we shall follow 
an alternative course and develop a theory based on the transducer gain. 

We first derive an equivalent circuit for linear noisy multiport networks. 
The noise performance of amplifiers is discussed next, considering an ampli- 
fier as a two-port, linear, noisy network with a transducer gain larger than 
unity. The unconditional stability conditions and the unilateral gain are 
also presented. The last three sections are devoted to the discussion of 
tunnel-diode and parametric amplifiers that illustrate the kind of noise 
performance to be expected from these practical devices. 

7.1 Canonical Forms 

It was shown in Section 5.3 that the exchangeable power of a generator 
is invariant to lossless nonsingular transformations. A different proof of 
this theorem will now be given, using the impedance matrix of a lossless 
two-port network since it will set a pattern for deriving invariants of n-port 
noisy networks. 

In the circuit shown in Fig. 7.1, the exchangeable power of the generator 
connected to port I of the two-port network is given by 

P 0 = |E| 2 /(4ReZ,) (7.1) 

and the relation among V u V 2 , I u and I 2 is given by 
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Fig. 7.1. Lossless transformation of a one-port generator. 



Since V t is equal to E— Z,/,, we have from (7.2) 

V l =Z ll I 1 +Z t2 I 2 =E-Z i l l (7.3) 

Assuming that Z x x + Z, is not equal to zero, l x is obtained in terms of E 
and I 2 , 

i i ={z li + z i y 1 (E-z l2 i 2 ) 

Substituting this into (7.2), we have 

V 2 = Z 2l I 1 +Z 22 I 2 = Z 2l (Z i + Z 11 ) 1 E + {Z 22 — Z 12 Z 2I (Z x + Z u ) 1 } I 2 

(7.4) 

Comparing this with (7.3), and remembering that the positive direction of 
I 2 is inward, the open-circuit voltage and the output impedance looking 
back at port II are given by 

Z 21 (Z i + Z n r‘E and {Z 22 - Z 12 Z 21 (Z, + Z u )~ >) 
respectively. The exchangeable power at port II therefore becomes 
Pi = |E| 2 |Z 21 | 2 | Z, + Z u \~ 2 / 4 Re {Z 22 - Z 12 Z 2 , (Z, + Z u )"} (7.5) 



the denominator of which can be rewritten in the form 



4 Re (Z 22 — Z 12 Z 21 (Zj + Z n ) 1 } 

-~>\y | T* Z I2 Z 21 Z 12 Z 21 ] 

1” + Z 22 - (7.6) 

When the two-port network is lossless, the impedance matrix Z must satisfy 
the lossless condition 

Z + Z + =0 

or equivalently. 



- “Zu, Z, 2 — — Z 21 , 
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With these relations, the right-hand side of (7.6) reduces to 
IZ 21 | 2 (Z, + Z 11 +Z,* + Zt,) 



IZ. + Z..I 



= |Z 21 | 2 |Z, + Z 11 r 2 4ReZ i 



Replacing the denominator of (7.5) by this expression, we have 

iV = |E| 2 /(4ReZ,) = f> 0 (7.7) 

which shows the exchangeable power is invariant to the lossless transfor- 
mation. It is worth noting that we assumed nonzero Z n + Z, and nonzero 
Z 2 1 in the above calculation. The assumption of nonzero Z 21 was used to 
avoid making (7.5) indeterminate. 

Since the exchangeable power has been defined only for one-port generators, 
a question arises whether or not there are some quantities for an «-port 
generator which are invariant to lossless transformations and hence corre- 
spond to the exchangeable power of a one-port generator. To answer this 
question, let us study a lossless transformation of a linear n-port generator 
following the pattern set by the above discussion. 

Some voltages may appear at each port of the w-port generator when all 
the ports are open circuited. Let E t , E 2i ..., E n be the voltages where the 
subscripts refer to the port numbers. We define e by 



e = 




(7.8) 



Referring again to the discussion concerning an equivalent circuit for one- 
port generators in Section 1.4, we see that the relation between the voltages 
and current at the ports is given by 



v = Zi + e (7.9) 

where v is the voltage vector, i the current vector, and Z the internal imped- 
ance matrix of the n-port generator. An equivalent circuit for (7.9) is shown 
in Fig. 7.2. 

Since we used a two-port lossless network to transform a one-port 
generator, we should consider using a 2«-port lossless network to transform 
an n-port generator. Taking the voltage and current directions for the 2 n- 




% 




h 

$ 




Fig. 7.2. Equivalent circuit of an «-port generator. 

port network as shown in Fig. 7.3, we have 

V al _ Y^aa Zab] 

. V 6J \7 J ba 

where v a and v b are the voltage vectors, while i a and i b are the current 
vectors at reference planes a and b, respectively. The terms Z aa , Z ab , Z ba , 
and Z bb are all square matrices of order n, and together they form the 
impedance matrix of the 2n-port network. Since v = v a , we have from (7.9) 
and (7.10) 

Zi + e = Z aa i a + Z ab i b 



L*6J 



(7.10) 




Fig. 7.3. Lossless transformation of an w-port generator. 
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Keeping in mind that i and i a are equal, except for their directions being 
opposite, we obtain from the above equation 

l a — ( z + z J 1 (e - ZJ fr ) 

where an assumption is made that (Z + ZJ is nonsingular. Substituting 
this into (7.10), v b can be expressed in terms of e and i b ; 

v 6 = 'Z j ba \ a + Z bb i b 

= {Z» - Z ba (Z + ZJ-* Zj i 6 + ZJZ + ZJ' 1 e (7.11) 

Let e' be the open-circuited voltage vector and Z' the internal impedance 
matrix observed at reference plane b , then we have from (7.11) 

e' = ZjZ + Z J-*e (7.12) 

Z' = Z M -Z 6a (Z + Zj- , Z at , (7.13) 

The exchangeable power of a one-port generator is given by the square of 
the absolute value of the open-circuited voltage divided by four times the 
real part of the internal impedance. Corresponding to the real part of the 
internal impedance, we should take one half of Z-l-Z + , the sum of the 
impedance matrix and its adjoint. Corresponding to the square of the 
magnitude of the open-circuited voltage, \E\ 2 = EE*, we may consider 
either ee + or e"e; however, since e + e becomes a matrix of order 1 x 1, or 
equivalently, a scalar quantity, it does not match the above matrix form of 
the impedance. Thus, ee + , a square matrix of order n, seems to be a natural 
choice. In following sections, we shall study the case in which the £j’s 
represent noise voltages, and hence the ensemble average <£,£/> is more 
meaningful than the particular value of Efij* itself. For this reason, let us 
choose <ee + >, whose ij component is (E i E*'), instead of ee + itself. 
Corresponding to the fact that |£| 2 is either positive or zero, <ee + > is 
either positive-definite or positive-semidefinite. This can be seen from 

<x + e(x + e) + > = x + <ee + > x ^ 0 (7.14) 

where x is an arbitrary fixed vector and the angle brackets < ) indicate the 
ensemble average of the quantity within. 

We are now in a position to investigate with the help of (7.12) and (7.13) 
how <ee + > and Z+Z + are transformed by the lossless 2«-port network. 
From the lossless condition, the sum of the impedance matrix of the 2 n- 
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port network and its adjoint must be zero; 




or equivalently, 

Z aa = ~ Z+a , Z ab = — Z ba , Z bb = — Z bb (7.15) 

From this, we see that Z aa and Z bb are restricted by the condition that their 
adjoint matrices must be equal to themselves except for the opposite signs. 
On the other hand, there is no real restriction imposed on Z ba . Once Z ba 
is given, Z aft is fixed by the middle condition of (7.15); it therefore follows 
that either Z ba or Z a6 can be chosen arbitrarily. 

For convenience, let us define H + as 

H + = Z ba (Z + ZJ -1 (7.16) 

Since Z fta is arbitrary, H 1 can also be chosen arbitrarily by selecting a proper 
lossless network for the transformation. Using H + thus defined, <e'e' + > 
can be written in the form 

<eV + > = <H + e(H + e) + > = H + <ee + > H (7.17) 

where (7.12) is used. This equation indicates how (ee + ) is transformed 
by the 2«-port network. For the transformation of Z + Z + , let us consider 
Z + Z' + . From (7.13), we have 

Z + Z' + = Z bb - Z ba (Z + ZJ" 1 Z ab + Z+ - Z+(Z + + ZJ“ 1 Z l 
Using (7.15), this can be simplified as follows: 

Z + Z >+ = ZJZ + ZJ- 1 Z/ fl + ZJZ + + z J - 1 z; a 
= z ba {(Z + ZJ- 1 + (Z + + zj- 1 } z ; 

= Z 6a (Z + z aa )- 1 {(Z + + Zi) + (Z + Z J} (z + + z J- 1 z i 
= H + (Z + Z + ) H (7.18) 

A comparison of (7.17) and (7.18) shows that <ee + > and Z + Z + undergo 
the same transformation by the lossless 2«-port network. Since (Z + Z + ) + = 
Z + Z + and «ee + » + =<ee~>, it follows that <ee + > and Z + Z + are 
both self-adjoint. Furthermore, <ee + > is either positive-definite or posi- 
tive-semidefinite from (7.14). Let us assume for the moment that <ee + > 
is positive-definite. Then there are two self-adjoint matrices, and one of 
them is positive-definite. Consequently, it is possible to select a nonsingu- 
lar H which diagonalizes both <e'e , + > and Z' + Z' + , simultaneously, as 
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shown by (5.60) and (5.61). The diagonal components of Z' + Z' + are 
given by the eigenvalues of «ee + » _1 (Z + Z + ) which are all real. Let 
A 2 ,..., A„ be the real eigenvalues, then we have 

<e'e' + > = I (7.19) 

Z' + Z' + = diag [A x A 2 ... AJ (7.20) 

In order for H to exist, Z + Z aa has to be nonsingular, and for H to be 
nonsingular, Z ba has to be nonsingular. Since these conditions do not 
contradict the lossless condition given by (7.15), it is always possible to get 
an appropriate lossless 2n-port network which gives (7.19) and (7.20). 

Let us investigate the physical meaning of the result obtained above. It 
follows from (7.20) that the new n-port generator has an internal impedance 
matrix of the form 

Z =^diag[Aj A 2 ... A„] + Z r (7-21) 

where Z r is an n-port impedance matrix satisfying the lossless condition 

Z r + Z, + = 0 (7.22) 

In order to obtain (7.19) and (7.20), we have selected an appropriate H; 
however, since H contains only Z ba and Z aa . , Z bb of the 2n-port network can 
still be selected arbitrarily provided that Z bb + Z bb = 0. Since the subtraction 
of Z r from Z bb does not change this condition, we can always select a 
proper Z bb which makes Z r in (7.21) disappear. Remembering that A l5 A 2 , 
..., A n are all real, this means that by selecting a proper lossless 2n-port 
network the transformed circuit is equivalent to n resistors, each connected 
in series with a voltage source expressed by a component of e'. Since 
<e'e' + ) = I, each voltage source is uncorrelated with the others and has a 
unit magnitude as illustrated in box C on the left-hand side of Fig. 7.4. 
This is called the canonical form of the original n-port generator, and the 
operation performed by the lossless 2n-port network is called the canonical 
transformation. 

The lossless 2n-port network transforms v and i of the n-port generator 
into \ b and i b . Since v = v 0 and i = - i fl , we have 

v = - Z aa i + Z ab \ b 
v b = ~ Z fca i + Zbbh 

From (7.23), if, is given by 



h ~ ^ab"* + ^abZ aa i 



(7.23) 

(7.24) 
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Fig. 7.4. Equivalent circuit of an n-port generator including the canonical form C and 
2n-port transforming network T _1 where T is the canonical transformation. 



where the existance of Z~ b is guaranteed since H, and hence Z ba , is chosen 
to be nonsingular and Z ab = — Z ba from (7.15). Substituting the above 
equation into (7.24), we obtain 

V 6 = Zf^Z^f/v + {f'bb^ab 7'aa ~ ^ba) * 

The relation between the voltages and currents at the input and output ports 
of the 2/i-port network is therefore given by 



v i> = ^bb^ab Z ba — Z bb Z ab Z aa V 
*t>. . — L ab Z ab Z aa — i 



Thus defined, T is the transfer matrix representing the lossless 2/i-port 
network which performs the canonical transformation; T satisfies the loss- 
less condition (6.60), and if T exists, it is nonsingular from (6.60). In general, 
the transformation performed by a circuit with nonsingular transfer matrix 
is said to be nonsingular; therefore the canonical transformation is non- 
singular. 

Multiplying (6.60) by (T + ) _l from the left and by T -1 from the right, 
we obtain 

(T -1 ) + LT _1 = L (7.26) 

A comparison of (7.26) with (6.60) shows that T -1 represents another loss- 
less 2«-port network. Suppose that the original n-port generator is first 
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transformed by T into its canonical form and is next transformed by T -1 , | 

then the resultant network has exactly the same v and i as those of the original f 

/7-port generator. We, therefore, conclude that an arbitrary /7-port generator i 
is equivalent to its canonical form followed by a lossless 2/i-port network 
whose transfer matrix is given by T . | 

The eigenvalues, A ly A 2 ,..., A„, of «ee + » _1 (Z + Z + ) are invariant to a % 

nonsingular, lossless, but otherwise arbitrary, transformation T x . This J 

follows because the cascade connection of two lossless 2/7-port networks * 

with transfer matrices Tj~ 1 and T gives the same canonical form characterized if 

by A ly A 2 ,..., A„. Furthermore, since the canonical form is solely determined I 

by the eigenvalues, there are no invariants other than A iy A 2 , ..., A n and their | 

functions when arbitrary nonsingular lossless transformations are considered. i 

For one-port generators, the exchangeable power is invariant to lossless * 

nonsingular transformations. This is understandable because the exchange- f 
able power is given by l/(2Aj), where A x is the eigenvalue of (<ee + » -1 x f 

(Z+Z + ) which is a matrix of order 1 x 1 in this case. The exchangeable f 

power and its functions are the only invariants to lossless nonsingular . 

transformations. J 

Finally, let us investigate the case in which <ee + > is positive-semi- £ 

definite. In this case «ee + >) -1 does not exist; however, by adding a small I 

voltage vector At to the open-circuited voltage e at the terminals <(e + de) x | 

(e + At) + > can be made positive-definite. For this perturbed /i-port generator, J 

the canonical form can be obtained as we have done above. In this canonical % 

form, A x , A 2 , A n are functions of the small voltages making up the vector 
At. Since the perturbed generator approaches the original generator as At 
approaches zero, let us assume that the canonical form of the original /7-port 
generator is obtained by taking the limit as de-> 0. 

During this limiting process, one or more of A x , A 2 ,..., A n may become 
infinite showing that the values of the corresponding resistor are infinite. 

There are two possible cases: (i) The resistor should have a finite value in 
series with a zero-voltage source, but since a unit voltage source is assumed, 
the resistance becomes infinite; (ii) The resistor is actually open-circuited 
and its value is infinite. It is difficult to distinguish these two cases in a 
general discussion. 

Summarizing the above results, the following conclusion will be reached: \ 

An «-port generator is equivalent to n resistors each with an independent ? 

voltage source, followed by a lossless 2/7-port transforming network as | 
illustrated in Fig. 7.4. When <ee + > is positive-semidefinite, the number of 
the resistors may be less than n and/or some of the voltage sources disappear. 
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7.2 Noise Measure 

An amplifier can be considered as a two-port network which has specific 
generator and load impedances assigned to it. In addition the network 
generally contains noise sources. From the conclusion reached in the previous 
section, such a two-port network is equivalent to the canonical form followed 
by a lossless four-port network. The canonical form may have two resistors, 
one resistor, or none at all. With a passive network, the output power can 
never be larger than the input power, hence at least one resistor in the 
canonical form of an amplifier must be negative. This enables us to concen- 
trate on the following three cases: The canonical form has (i) two negative 
resistances, (ii) one negative resistance and one positive resistance, (iii) only 
one negative resistance. 

Let us first study case (i) for which the equivalent amplifier circuit is 
illustrated in Fig. 7.5. Ports I and II of the lossless four-port network 



CANONICAL LOSSLESS 

FORM CIRCUIT GENERATOR 




represent the input and output ports of the amplifier, respectively. Ports III 
and IV are connected to the negative resistances Z 3 = — R 3 and Z 4 = — R 4 
in the canonical form. Let incident and reflected waves a and b be defined 
such that their ith components are given by 

<*i = i |ReZJ -I/2 (Vj + Zjlj). b t = i ] Re Z,[ 1,2 (V, Z,*!,) (7.27) 

where the subscript refers to the port number. The scattering matrix S of 
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the lossless four-port network times a then becomes b, 

b = Sa 

From (5.87), S satisfies the lossless condition 



where P is given by 



S + PS = P 



P = diag[l 1 -1 -1] 



(7.28) 



(7.29) 

(7.30) 



since negative resistances are connected to ports III and IV. Multiplying 
(7.29) by (PS) -1 = S -1 P -1 from the right and by SP -1 from the left, 
we have 

SP -1 S + =P -1 (7.31) 

Equations (7.29) and (7.31) are equivalent to each other, and either one can 
be used as the lossless condition for S. The 22 component of (7.31) gives 



|S 21 | 2 + |5 22 | 2 -|S 23 | 2 -|5 24 | 2 = 1 



(7.32) 



which is the condition the S 2 i s must satisfy. 

With this much preparation, we are in a position to discuss the noise 
performance of the amplifier in detail. The transducer gain of an amplifier 
is defined by 

actual signal power into load 



available signal power from generator 



(7.33) 



For an amplifier shown in Fig. 7.5, the numerator is given by the signal 
component of \b 2 \ 2 . If |aj 2 represents the available signal power of the 
generator, \b 2 \ 2 = IS^il 2 j^il 2 . Thus, we have 



g = (M 2 /I « ] | 2 = |s 21 | 



(7.34) 



The exchangeable power is not used in the denominator of (7.33) because 
when the real part of the generator impedance is negative, the actual signal 
power flowing into a load impedance with a positive real part can be made 
as large as one wishes by inserting a lossless nonamplifying network. 

The operating noise temperature T op of an amplifier is defined by 



actual noise power into load 



(7.35) 



where k is the Boltzmann constant, and the noise power is measured in a 
narrow bandwidth B with a generator having a noise temperature T { . 
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From the definition of noise temperature, the available noise power of the 
generator is given by kT t B. When B = 1 MHz and T i = 290°K, kT t B is 
4xlO -15 W, or equivalently, —114 dBm. In general, the available noise 
power from an ordinary resistor at T a K is given by kTB as derived by a 
statistical consideration and confirmed by experiments, which we shall not 
discuss any further. 

The noise originating in and reflected back to the load is just as trouble- 
some as other noise components, and hence it is included in the operating 
noise temperature of (7.35). This can be seen from the definition of actual 
power given in Section 1.4. When the load impedance has a positive real part, 
the actual noise power flowing into the load is defined to be the net noise 
power transferred to the load plus the available noise power of the load, all 
in the bandwidth B. Thus, the operating noise temperature T op is a function 
of the noise temperature T L of the load, and T L may be assumed to be 290 °K 
if not stated otherwise. 

For the amplifier shown in Fig. 7.5, the ratio of the operating noise 
temperature T op to the generator noise temperature T t is given by 

fop _ I b 2 \ 2 _ l^2li \ a l \ 2 + 1-^22 1 2 \ a l \ 2 + 1 ^ 23 1 2 1^3 1 2 + IS24I 2 InJ 2 

T i - Mil 2 G “ |S 21 i 2 Mil 2 

(7.36) 



where all the power waves are comprised of noise components only and 

\atf~kTiB. 

Using G and the noise temperature ratio T op /T h we define the operating 
noise measure of an amplifier as 



(TJT f ) - 1 
1-(1/G) 



(7.37) 



If the transducer gain is large, M is essentially equal to (T op - T^fT { which 
is the ratio of the noise introduced in the amplification process to the 
amplified noise from the generator, ( kT op BG - kTfBG^kTiBG. A negative 
value of M means that G is smaller than unity. When G is larger than unity, 
as it should be for an amplifier, M is always positive; the amplifier will 
obviously become less noisy as M becomes smaller. When G decreases and 
(TJT t ) — 1 does not become smaller in value, then M cannot remain 
unchanged. When the gain is small, the added noise must be correspondingly 
small for the amplifier to be considered equally noisy. 

For a comparison of different amplifiers we use M x 7], rather than M 
itself, for the following reason: If T t is inceased for one amplifier, keeping 
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everything else the same, [S 21 J 2 | aj 2 increases in (7.36) and the output of 
the amplifier becomes noisier. At the same time M decreases giving the 
misleading impression that the amplifier is now quieter. On the other hand, 
M x Tj is independent of T h indicating the noise introduced in the ampli- 
fication process divided by kB(G - 1). 

Those who are familiar with the IRE (Institute of Radio Engineers) 
definition of noise figure F will notice that for a high gain amplifier with 
T; — 290 °K and a negligible load noise contribution, M is essentially equal 
to the excess noise figure (F — 1) of the amplifier. 

The noise figure F, at a specified input frequency, is defined as: the ratio of 
( 1 ) the total noise power per unit bandwidth at a corresponding output 
frequency “available” at the output port when the noise temperature of the 
input termination is standard (290 °K) to (2) that portion of (1) engendered at 
the input frequency by the input termination. The contribution from the 
noise originating in and reflected back to the load is not included in this 
definition. Realizing its importance, however, the IRE subsequently defined 
the operating noise temperature so as to include the noise contribution 
from the load. 

In our case, M can be written in the form 

w _ l^l 2 ^2 1 2 + 1^231 2 1^3 1 2 + I^l 2 1^4-1 2 n -q. 

(is 2 .l 2 -i)l«.l 2 ' 



where (7.34) and (7.36) are used. Rewriting the denominator with the help 
of (7.32), (7.38) becomes 

_ is 22 | 2 1« 2 | 2 + |S 23 1 2 l« 3 | 2 + |S 24 | 2 |« 4 | 2 
(-|S 22 | 2 -HS 23 | 2 + |S 24 | 2 )|n 1 | 2 

The expression inside the parentheses in the denominator cannot becomes 
less than -1 since JS 21 | 2 ^0 in (7.32); |S 22 | 2 , |5’ 23 | 2 , and | i S T 24 | 2 otherwise 
can take any positive values. 

Without a loss of generality, assume that |a 3 | 2 < |o 4 | 2 and under this 
assumption, let us investigate the possible range of the value of M. When 
|5 ’ 22 | 2 > IS 23 I 2 + 1 5” 24 1 2 > M is negative. In addition, the condition 

|S 23 | 2 = |S 24 | 2 = 0 



gives the minimum value for |M|, because when |S , 23 | 2 or ]S 24 | 2 departs 
from zero, the denominator on the right-hand side of (7.39) becomes smaller 
in magnitude and the numerator larger. Thus, M cannot come closer to 
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zero than ~\a 2 \ 2 l\a i \ 2 in the negative range. Since |a 2 | 2 is given by kT L B 
and |flj | 2 by kT^B, the maximum value in the negative range is 

= ~ ki 2 | 2 /K| 2 = - (TJTi) (7.40) 

When |S' 22 | 2 < \S 2i \ 2 + |S’ 24 | 2 , M is positive. If |S 23 | 2 and [*S’ 24 | 2 are kept 
constant, the condition |S’ 22 | 2 =0 minimizes M since the denominator in 
(7.39) decreases and the numerator increases if |S' 22 | 2 departs from zero. 
Therefore, in order to obtain the smallest value of M in the positive range, 
we have only to consider the case in which |S' 22 | 2 = 0. Since |a 3 | 2 < |a 4 | 2 by 
hypothesis, we have 

|S 23 I 2 I« 3| 2 + |S 24 | 2 M 2 ^ |S 23 | 2 |a 3 | 2 + |S 24 | 2 |a 3 | 2 |a 3 | 2 

(|S 23 I 2 + |S 24 | 2 ) Kl 2 " (|S 23 | 2 + |S 24 | 2 )K1 2 _ | fll | 2 

j II follows that when |S' 22 | 2 = |*S’ 24 | 2 = 0, M attains the smallest positive 

value M opt which is called the optimum noise measure and is given by 

r ^opt = l“ 3 | 2 /|di| 2 = - (P'.mJkTfi) (7.41) 

where P e , min indicates the exchangeable noise power with the smaller 
magnitude obtainable from the two negative resistances in the equivalent 
circuit. Later we shall discuss a case in which many negative resistances are 
involved. 

; The magnitude of M can increase indefinitely in the negative as well as 

* in the positive range if the scattering coefficients are properly selected. The 

solid lines in Fig. 7.6, therefore, give the possible range of M. In order to 
obtain M opt , the conditions [S 22 | 2 = 0 and |5' 24 | 2 =0 must be satisfied, 




Fig. 7.6. The possible range of M. 
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unless T l = 0 and |fl 3 | 2 = |a 4 | 2 , respectively. In other words, the output 
port must satisfy the matching condition and the negative resistance with 
the smaller absolute value in the canonical form must be effectively discon- 
nected from the output port. 

Given an amplifier, one method to achieve M opt is shown in Fig. 7.7. 




i i 



Fig. 7.7. Realization of M op t- 

Here the amplifier is represented by an equivalent circuit comprising the 
canonical form C followed by a lossless transformation T~ l . The lossless 
four-port network with the transfer matrix T connected to the amplifier 
restores the canonical form and the negative resistance with exchangeable 
noise power min — — |<ar 3 1 2 in the canonical form is connected to the 
generator and load through a three-port circulator. The other resistance is 
left open. Both conditions |5’ 24 | 2 =0 and |5' 22 | 2 =0 are satisfied by this 
arrangement, and hence M opt is realized. It may be worth mentioning that 
the input port is also matched in this realization of Af opl . 

We have so far discussed case (i) in which two resistances in the canonical 
form are negative. The discussions for the other two cases are similar. When 
one resistance is negative and the other is positive in the canonical form, we 
assume that the negative one is connected to port III and the positive one 
to port IV of the lossless four-port network shown in Fig. 7.5. Then, 
corresponding to (7.39), we have 

is 22 | 2 M 2 + is 23 l 2 |n 3 | 2 + |s 2 4l 2 [a 4 l 2 

“( 1 |S 22 | 2 + |S 23 | 2 -|S 24 | 2 )|a 1 | 2 

When there is only one negative resistance in the canonical form, we assume 
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that the negative resistance is connected to port III, and that port IV is 
eliminated from the beginning. Then, we have 

|S 22 | 2 |a 2 | 2 + |S 23 | 2 |n 3 | 2 
(- |S 22 | 2 + |S 23 | 2 )K| 2 

For both cases, the optimum noise measure is given by 
M„p, = | 03 l 7 l«,| 2 = ~{P., m JkT,B) 

where P e>min is equal to the exchangeable noise power of the negative 
resistance in each case. 

Let us now ask the following question: What is the best (i.e., smallest 
positive) noise measure obtainable when we are given several amplifiers 




Fig. 7.8. Equivalent circuit of the composite amplifier. 
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and are allowed to use any passive networks in order to connect them? A 
passive network can be considered to be a lossless multiport network with 
some of the ports connected to positive resistances. The most general 
equivalent circuit of the amplifier we can construct is therefore given by 
Fig. 7.8 where the lossless parts of the component amplifiers are all included 
in the lossless network. Once this equivalent circuit is obtained, we can 
easily apply an argument similar to the one previously used, and the optimum 
noise measure is given by 

M opt = -(P e , m Jkm 

where P emin is the exchangeable noise power with the smallest magnitude 
among all the negative resistances in the equivalent circuit. This result 
shows thatthere is no possibility of improving the noise measure by combining 
amplifiers. The best value we can obtain from the combined amplifier is 
equal to the best value obtainable from the single amplifier having the 
smallest M op( . 

On the other hand, M nl is given by 

M m = - {T m JT) 

where T min is the lowest noise temperature of all the positive resistances in 
the equivalent circuit. The range of possible M is from M opt through infinity 
to M m , which is similar to Fig. 7.6. 

Suppose that two amplifiers are connected in cascade through a circulator 
as shown in Fig. 7.9. The third port of the circulator is terminated by the 
matched resistance R 0 having a noise temperature T l i- The reference 



impedances of the circulator used to define the scattering matrix are S 




CIRCULATOR t 

J* 

Fig. 7.9. Cascade connection of two amplifiers. | 



# 

;f 
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so that both amplifiers see the correct load and generator impedances assigned 
to them. Since the actual power flowing into port I of the circulator is 
transformed to the available power from port II, the overall gain G becomes 
the product of the individual amplifier gains: 

actual signal power into load 

G = — . 

available signal power from generator 

actual signal power into circulator 

available signal power from generator 

actual signal power into load 

x — 

available signal power from circulator 

= C,C 2 (7.43) 

where Gy and G 2 represent the transducer gains of the first and second 
amplifiers, respectively. 

The actual noise power N flowing into the load is given by the sum of the 
amplified noise and the noise added by the second amplifier. The added 
noise is equal to M 2 kT i B(G 2 — 1), where M 2 is the noise measure of the 
second amplifier with generator noise temperature T). The available noise 
power at the input of the second amplifier is equal to the actual noise power 
flowing into port I of the circulator in Fig. 7.9, namely, M l kT i B(G 1 - 1) + 
+ kT i BG l , where A/ x is the noise measure of the first amplifier with generator 
noise temperature T t . Thus, we have 

N = kT i BG i G 2 + MykTfiiGy - 1) G 2 + M 2 kT i B{G 2 - 1) 

from which the overall operating noise temperature T op can be obtained as 
follows: 

N ( 1 G 2 - 1\ 

= kBG “ r < Gi + ~ij-) (M4> 

where use is made of (7.43). Substituting (7.43) and (7.44) into (7.37), the 
overall noise measure becomes 



( 1 G 2 - 1\ 

= m 1 + (m 2 -m 1 ) 

GjG 2 - 1 




(7-45) 



Suppose that the two amplifiers are identical, then M x =M 2 and the second 
term on the right-hand side of (7.45) disappears which makes M =M { and 
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7.3. Unconditional Stability 
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G — G t 2 . If we connect another amplifier to this cascaded amplifier through 
a second circulator, we obtain an amplifier with M — M x and G = G l 3 . 
Continuing this process, if we connect a large number of identical amplifiers 
in cascade, we obtain a high gain amplifier having the same noise measure 
as that of the individual amplifiers. From this observation, we can interpret 
the noise measure of an individual amplifier as the excess noise figure 
(F^ — 1) of a high gain amplifier constructed by cascading a large number 
of such amplifiers. When the amplifier under consideration is matched at 
both ends and Z g = Z L *, the circulators can be eliminated from the above 
discussion. 

One might think that the circulator with a resistor R 0 at the third port 
can be replaced by a lossless network in Fig. 7.9. This is generally not true, 
however. First of all, the noise temperature of the input impedance of the 
second amplifier may not be the same as that of the load impedance of the 
first amplifier. Furthermore, if the first amplifier has an output impedance 
with a negative real part, there is no way of transforming this to have a 
positive real part by any lossless transformation, and hence the second 
amplifier is unable to see a proper generator impedance. The circulator 
effectively isolates the two amplifiers which would otherwise interfere with 
each other as described. 

The discussion in this section can be summarized as follows: 



(i) The smaller the positive value of M is, the less noisy the amplifier. 

(ii) Negative noise measure means that the gain of the amplifier is less 
than one, i.e., attenuation occurs rather than amplification. 

(iii) M opl can be realized by a lossless transformation. 

(iv) It is impossible to achieve positive M less than M opl by any passive 
transformation. 

(v) TheM op( of a combined amplifier is equal to the best M opi among the 
component amplifiers. 



7.3 Unconditional Stability 

An amplifier is said to be unconditionally stable if the real parts of its 
input and output impedances remain positive when the load and generator 
impedances are changed arbitrarily provided their real parts remain positive. 
Let us consider the condition for the unconditional stability. Let S lt , S l2 , 
S 21 , and S 22 represent the scattering coefficients of the amplifier where 
subscripts 1 and 2 refer to the input and output ports, respectively. We assume 




that the real parts of the reference impedances Z x and Z 2 are both positive 
and that IS^il <1 and | ^ 22 ! <1; otherwise the real parts of the input and 
output impedances cannot be assumed to remain positive. Suppose that the 
load impedance Z L is connected to the output port, then a 2 is found to be 

a 2 = {(^l — Z 2 )/(Z L + Z 2 *)} b 2 = r 2 b 2 

in a similar manner to the derivation of (5.132) where r 2 is the reflection 
coefficient of Z L with respect to Z 2 *. Since we keep the real part of Z L 
positive, |r 2 j < 1. The equation b = Sa becomes 

b 1 = S (1 a! + S 12 r 2 b 2 , b 2 = S 2l a x + S 22 r 2 b 2 

from which we obtain 

frj = (Sn + ^I2 r 2 | 5'2l(l — r 2^22) } a i (7.46) 

It follows from' this that the condition for the real part of the input impedance 
to remain positive is given by 

|Sn + S 12 r 2 S 21 (l — r 2 S 22 ) I < 1 

which is equivalent to 

| - AS 2 ^ + B( 1 - r 2 S 22 )~ 1 1 < 1 (7.47) 

where 

A = S I2 S 21 — Sj X S 22 , B = S l2 S 2I S 22 (7.48) 

The problem has now been reduced to finding the condition for (7.47) to 
hold regardless of the value of r 2 when |r 2 | <1. 

To find the above condition, let us next investigate the bilinear trans- 
formation 

z = - AS 22 + B (1 - r 2 S 22 )~ 1 (7.49) 

and find the image of the unit circle |r 2 l = 1 on the z plane. The image of the 
unit circle on the —r 2 S 22 plane is a circle with its center at the origin and 
radius |iS 22 |- The image on the (1 — r 2 S 22 ) plane is obtained by shifting the 
previous image to the right by a unit distance. The image on the (1 — r 2 S 22 ) 1 
plane is a circle with its center on the real axis and with intersection points 
on the real axis at (1 + |S 22 1) -1 and (1 — |S 22 |)“ ! . On the fi(l — r 2 S 22 ) 1 
plane, it becomes a circle with the center at 

1 ( B B \_ B 

2 \Ths 2 7i + 1 + is 22 ij ~ 1 - is7 
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and radius 

1/1*1 _ 1*1 \b\\s 22 \ 

2\i - |S„i 1 + |S 22 |7 1-|S 22 | 2 

Note that the inside of the unit circle |r 2 | =1 corresponds to the inside of 
this circle since [S 22 \ <1. The complex vector representation of this circle 
is given by 

c _ B , ' s ^< ^ 

1-|S 22 | 2 + 1-|S 22 | 2 

where 0 varies from 0 to 2n and q represents the vector drawn from the origin 
to the moving point on the circle. From this, we see that the unit circle 
|r 2 | =1 is transformed by (7.49) into a circle represented by 

A B \B\\S 22 \ 

z — — — -t- — -f- , e 

*22 1 - l*22| 2 1 - l*22| 2 



AS* 22 + *11 _|*| 1*22 

1-|S 22 | 2 1 1-|S 22 



(7.50) 



If the real part of the input impedance is to remain positive, the magnitude 
of z has to be smaller than unity regardless of the value of 9, i.e., 

M*22 + *,.1 + 1 *1 l*22l 
” 1 I *22 1 2 

The above inequality is equivalent to 



Let us assume that 



c , *22*12*21 t 1*12*2 1 1 

Sll + i-l*22l a ' 



l*12*2ll< 1-1*221 



(7.51) 



(7.52) 



otherwise (7.51) does not hold. Squaring both sides of (7.51), we obtain 
after a little manipulation 

2 R es n s 22 sr 2 si, < 1 - |S, a 2 - |S 22 | 2 + |S„S„| 2 

+ l* 12 * 21 | 2 — 2|S ]2 S 21 j 

which reduces to 

l*i 1 1 2 + 1*22 1 2 + 2|S 12 S 2 ,| < 1 4- fS 12 5 2 ] — S,jS 22 | (7.53) 

Thus, we see that the real part of the input impedance remains positive if 
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(7.52) and (7.53) are satisfied. The corresponding conditions for the real 
part of the output impedance to remain positive are obtained by inter- 
changing the subscripts 1 and 2. However, since (7.53) remains unchanged 
during this interchange, the conditions for the amplifier to be unconditionally 
stable are given by 

|S 12 S 2 ,|<l-[S n , 2 

|S 12 S 2J | < 1 — |S 22 [ 2 (7.54) 

|s,,| 2 + |S 22 | 2 + 2|S 12 S 2 ,| < 1 + I S I2 S 2I - S U S 22 | 2 

Note that (7.54) is a necessary and sufficient condition for unconditional 
stability. 

If an amplifier is unconditionally stable, it can be simultaneously matched 
to generator and load impedances with positive real parts. For the proof of 
this statement, we proceed as follows. When the load impedance Z L is 
connected to the output port, from (7.46) the reflection coefficient at the 
input port is given by 

(Z, - Z * )/(Z, + Z,) = S„ + S 12 r 2 S Z i (1 - r 2 S 22 )~ 1 

where Z, is the corresponding input impedance. Let be the reflection 
coefficient of Z g with respect to Z t *, i.e., 

r, = (Zy — Z t )l(Z g + Z,*) (7.55) 

and suppose that r x * is equal to (Z, - Z x *)((Z, + Z, ), then Z g becomes the 
complex conjugate of Z, indicating that the input port is matched. Thus, 

f, — S u + S l2 t‘ 2 S 2 1 (1 — r 2 S 22 ) 1 (7.56) 

gives the matching condition at the input port when the load impedance is 
Z L . Similarly, when the generator impedance Z y is connected to the input 
port, the matching condition at the output port is given by 

r 2 ~ *22 + *2 1 r 1*12 (1 — r l*l l) 1 (7.57) 

For simultaneous matchings, these two equations must both be satisfied. 
To simplify the following discussion, let us now assume that either one of 
*i, or * 2 2 * say S 22 , is equal to zero. This can be done without loss of 
generality when we discuss unconditionally stable amplifiers since, for a given 
Z t with a positive real part, Z 2 with a positive real part can always be chosen 
to satisfy the matching condition 5 22 = 0. Under this assumption, elimi- 
nating r 2 from (7.56) and (7.57), we have 

^ 2 *u + ^i(l* 12 * 2 i ! 2 - l*i 1 1 2 - i) + s* n = o 



(7.58) 




312 7. LINEAR AMPLIFIERS 

The solutions of this equation are given by 

r = 1 _+ iSnl 2 --|S 12 S 2 |l 2 ± {(1 + | 5 11 | 2 -| 5 12 s 21 | 2 ) 2 - 4 | 5 n | 2 } 1/2 

(7.59) 

where S n #0 is assumed; otherwise the input is matched to Z lt and no 
discussion is necessary. The expression inside the square root is positive 
since (7.54) reduces to 1 — [S^l > |S , 12 5' 21 | which is equivalent to 

l + |S n | 2 -|S 12 S J1 | 2 >2|S 11 | 

The product of two solutions of (7.58) is equal to S*JS n . Hence, one of 
the solutions should have a magnitude smaller than one. With this solution, 
the simultaneous matchings can be achieved keeping the real part of the 
generator impedance positive. Since the output impedance has a positive 
real part under this condition, the matched load impedance must also have 
a positive real part, which completes the proof. 

From the above discussion concerning unconditionally stable amplifiers, 
we see that Sj, and S 22 can be set equal to zero from the beginning by 
choosing proper values for Z, and Z 2 , each with a positive real part. This 
greatly simplifies the discussion, for example, if S n = <S 22 = 0, (7.54) 
reduces to 

|S, 2 S 2 il < i ( 7 - 6 °) 

For a given unconditionally stable amplifier and a fixed generator impe- 
dance, let us consider the effect of varying the load impedance on the trans- 
ducer gain. The maximum transducer gain is obtained when the output 
port is matched since the generator and the amplifier together can be 
considered as a new generator with an internal impedance having a positive 
real part. Next, let us consider the case in which the generator impedance is 
varied, keeping the available power of the generator constant. For a given 
load impedance, the output power, and hence the transducer gain is 
maximized when the input port is matched since the load current is propor- 
tional to the input current which is maximized when the input port is 
matched. Now suppose that we simultaneously change the generator and 
load impedances keeping the output port matched. Suppose also that under 
this condition, the generator impedance Z y gives the highest transducer 
gain and let Z 2 be the corresponding load impedance, then the output port 
is guaranteed to be matched. If the input port is not matched, then the same 
load impedance Z 2 has a corresponding generator impedance Z y which 
gives the matching condition at the input port and hence a higher transducer 
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gain. With Z y , the output port may not be matched. However, there is a 
load impedance Z 2 ' which satisfies the matching condition at the output 
port, and hence gives a still higher transducer gain. This contradicts the 
assumption that Z y gives the highest transducer gain under the matched 
condition at the output port. We, therefore, conclude that when the generator 
and load impedances of an unconditionally stable amplifier are allowed to 
change arbitrarily, provided their real parts are positive, the maximum 
transducer gain is obtained when both ports are simultaneously matched. 

Let us consider the canonical form of an unconditionally stable amplifier. 
If the amplifier does not contain a positive resistance, the real part of the 
impedance looking into one port of the amplifier must become negative 
when the termination at the other port becomes almost purely reactive. 
Consequently, there must be a positive and a negative resistance in the 
canonical form of an unconditionally stable amplifier to ensure the impedance 
has a positive real part. The equivalent circuit is shown in Fig. 7.10. Let Z, 
and Z 2 be the reference impedances at the input and output ports used to 
define the scattering matrix such that ReZ x > 0, ReZ 2 > 0, S n = 0, S 22 = 0. 
Note that we can assume \S 2i \ >1, since \S 2l \ 2 is the maximum transducer 
gain of the amplifier. 




Fig. 7.10. Equivalent circuit of an unconditionally stable amplifier. 



Now assume that port IV in Fig. 7.10 can be effectively disconnected 
from port II by choosing a proper generator impedance, then M op[ is 
realized provided an appropriate load impedance is chosen to satisfy the 
matching condition at the output port. The existance of such a load imped- 
ance with a positive real part is guaranteed by the unconditional stability. 
Let us now investigate whether or not ports II and IV can be effectively 
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disconnected as assumed above by connecting some Z g with a positive real 
part at port I. When Z g (with no voltage source in series) is connected to 
port I, 

<*i=r 1 b l (7.61) 

where is the reflection coefficient of Z g with respect to Z t * as shown in 
(7.55). Since the voltage sources in Fig. 7.10 are independent of each other 
and the circuit is linear, the principle of superposition holds, and we can 
assume that a 2 = a 3 = 0 when investigating the effect of a 4 on the output 
port. Remembering that 5' 11 =0, the first two scattering equations then 
become 

^1 ~ ^14^4? ^2 = ^21 a l + *^24^4 

Substituting (7.61) in the second equation and solving for b 2 , we obtain 
b 2 = ( 5 2 iOS 14 + S 24 ) n 4 
Therefore, if we choose Z g such that 

r \ — ~ (^ 24 /^ 2 1 ^ 14 ) (7.62) 

then ports II and IV are effectively disconnected. In order to ensure 
ReZ ff > 0, we have to show that \r t \ < 1. From the lossless condition (7.31), 
the 11, 22, and 12 components give 

|S 12 | 2 -I + |S 14 | 2 = |S 13 | 2 
|S 2 i| 2 -1 + |S 24 | 2 = |S 23 | 2 
|Si 3 S 23 | = |S 14 S 24 j 

respectively. Multiplying the left-hand sides of the first and second equations 
and equating the result to |S , 14 S 24 | 2 , which is equal to IS^S^I 2 from the 
equation, we obtain 

(|S 21 | 2 - 1)(|S 12 | 2 - 1) + |S 24 | 2 (|S 12 | 2 - 1) + |S 14 | 2 (|S 21 | 2 - 1) = 0 
which is equivalent to 

(* - |S l2 | 2 ) {1 + |S 24 | 2 (|S 2 ,| 2 - 1)-} = |S, 4 | 2 (7.63) 

Since iS 2 J 2 > 1, and from (7.60) 

is 12 i 2 <is 21 r 2 

the substitution of |S 12 | 2 by IS^J 2 in (7.63) gives 

a - \s 2 i r 2 ) {i + is 24 i 2 (is 21 i 2 - 1)- 1 } < is J 4 i 2 
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which reduces to 

|S 2 i| 2 - 1 + |S 24 | 2 < |S, 4 S 21 | 2 
However, since |S 2 ,| 2 >1, this shows that 

|S 24 | 2 <|S 14 S 2 ,| 2 

which guarantees |r, | < 1 from (7.62). 

From the above discussion, we conclude that M opl can be realized by 
choosing proper generator and load impedances when the amplifier is 
unconditionally stable. In other words, neither feedback nor a circulator 
is necessary to obtain the optimum noise measure. If the generator and load 
impedances are specified beforehand, we can insert lossless two-port net- 
works at the input and output ports to transform these impedances to 
appropriate values. The unconditional stability at one frequency does not 
guarantee the same stability at other frequencies. If adjustable circuits are 
inserted at the input and output in order to obtain M opt , the system may 
oscillate at some other frequency. The circuits which transform the generator, 
and load impedances for M opt must, therefore, be designed carefully in order 
to maintain negligibly small insertion loss at the desired frequency while at 
the same time preventing possible oscillations at other frequencies. 



7.4 Unilateral Gain 

In Section 7.1, it was shown that the values of the resistances in the 
canonical form are invariant to nonsingular lossless transformations. These 
invariants are unique in the sense that they and their functions are the only 
invariants. However, if the transformations are assumed to be reciprocal as 
well as lossless, we can expect still more invariants. In fact, there is at least 
one more invariant called the unilateral gain which is defined by 



fdet (Z — Z f )| 
det(Z + Z*) 



(7.64) 



In this section, we shall prove the invariance and then rewrite the formula 
in terms of the scattering matrix in order to see the origin of its name. 

Suppose that an «-port network having an impedance matrix Z is trans- 
formed into another u-port network by a nonsingular, lossless, reciprocal 
2/j-port network. The new impedance matrix Z' is given by (7.13). The 
lossless condition for the 2«-port network is expressed by (7.15). The 
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reciprocal condition gives 

Z aa = Z 00l , Z ab — Z ba , , Z bbl = Z bb (7.65) 

Combining (7.15) and (7.65), we see that the component matrices are 
expressible in terms of real matrices X aa , X ab , X bb as follows: 

7'aa = JX aa , Z afc = jX ab , Z bb = jX bb 

where X aa and X bb are both symmetrical with respect to the main diagonal, 
and X ab = (X ba ) ( . In terms of these real matrices, (7.13) becomes 

z = X ba (Z + j’X aa ) 1 X ab + jX bb (7.66) 

An inspection of (7.66) shows that the impedance transformation can be 
decomposed into three different types of transformation: 

(i) Z' = Z+jX, where X, = X; 

(ii) Z' = XZX„ where X represents an arbitrary real matrix; 

(iii) Z = Z l . 

If a certain quantity is invariant to these three transformations, it is also 
invariant to the transformation given by (7.66). For the proof of the in- 
variance of U, therefore, we have only to show the invariance during these 
transformations. For transformation (i), we have 

z' - z; = z +jx - (z, +jx t ) « z - z, 

Z + Z'* = Z + jX + (Z* - jX) = z + z* 

and hence U is invariant. For transformation (ii), we have 

z - z; = xzx t - (xzx t ), = x (z - z,) x, , 

Z + Z'* = XZX, + (XZX,)* = X(Z + Z*) x t 

Using these relations, U' is calculated to be 

= |det(Z- - Z/)| = IdetXj 2 |det(Z - Z,)| 

“ det (Z' + Z'*) (det X) 2 det (Z + Z*) 

and hence U is again invariant. Finally, for transformation (iii) we have 

z'-z/ = z- , (z-z f )(-z f )- 1 

Z' + Z'* = Z _1 (Z + Z*) (Z *)~ 1 
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and hence 

_ | det (Z' - Z/)| _ [det Z _ 1 1 2 |det(Z - Z f )| = 
det(Z' + Z'*) |detZ~ 1 | 2 det(Z + Z*) 

This completes the proof that U is invariant to nonsingular, lossless, recip- 
rocal transformations. 

Let us next consider how to express U in terms of the scattering matrix S. 
Using (5.79), Z — Z t can be rewritten in terms of S as follows: 

Z - Z t = F" 1 (I - S)“ 1 {(SG + G + ) F 2 (I - S p ) 

- (I - S) F 2 (GS, + G + )} (I - S,)" 1 F" 1 
= F - 1 (I — S)~ 1 (SF 2 (G + G + )-F 2 (G + G + )Sj (I-S,) -1 F" 1 
= ^F -1 (I — S) -1 (SP-PS,) (I-S,)' 1 F _1 (7.67) 

Similarly, Z + Z* becomes 

Z + Z* = F~ 1 {(I — S )~ 1 (SG - G + G + G + ) 

+ (I - S*) _1 (S*G + - G + + G + + G)} F 
= F“ 1 { — I + (I — S)~ 1 + (I - S*)" 1 } (G + G + ) F 
= P’~ 1 (I-S)~ 1 { — (I — S) (I-S*) 

+ (I - S*) + (I - S)} (I - S*) -1 PF -1 
= jF _1 (I — S) -1 (I-SS*)(I-S*) _1 PF -1 (7.68) 

Substituting (7.67) and (7.68) into (7.64), we obtain the desired expression. 



|det(SP - PS,) | 
det P det (I — SS*) 



(7.69) 



Suppose that a two-port network with unilateral gain U is imbedded in a 
nonsingular, lossless, reciprocal circuit and that S l2 of the resultant circuit 
becomes zero; i.e., the circuit is unilateralized. The U of the unilateralized 
circuit should remain the same as before because of the invariant property. 
With 5” 12 = 0, the generator and load impedances do not affect the output 
and input impedances, respectively, thereby permitting the input and output 
ports to be matched independently of each other. This means that it is 
always possible to assume that S n = 0 and .S 22 = 0 for the unilateralized 
circuit by using proper reference impedances. Equation (7.69) then becomes 



u^PiPz is 2 ii 2 



When = p 2 =1, U represents the maximum transducer gain of the uni- 
lateral circuit which explains why U is called the unilateral gain. 
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A method for unilateralizing a two-port network is illustrated in Fig. 7.1 1. 
The box between ports I and II represents the given two-port network to be 
unilateralized. First, a reactance jX is connected in series with port II to 
bring V 2 into phase with V t when J v = 0. Next the ideal transformer is 
adjusted to cancel V 1 so that V/ becomes zero. Z' l2 of the transformed 



I : n 




Fig. 7.11. Unilateralization of a two-port network. 



circuit then becomes zero by definition, and the corresponding S l2 vanishes , i 

from (5.78), i.e., the circuit is unilateralized. f 



7.5 Tunnel Diode Amplifiers 

If we measure the terminal characteristic of a tunnel diode, we shall find 
a certain range of bias voltage in which the current decreases as we increase 
the voltage as shown in Fig. 7.12. Let us fix the bias voltage V 0 somewhere 





in this range and superpose a small microwave signal. Then, when the alter- 
nating voltage swings in the positive direction, the corresponding alternating 
current swings in the negative direction and the device exhibits a negative 
resistance characteristic. The magnitude of the negative resistance is given 
by the inverse of the slope of the I-V characteristic at the bias point; i.e., 
cotfl = - R. Once a negative resistance becomes available, an amplifier can 
be built as we discussed at the end of Section 5.8. Before we discuss the 
amplifier, let us study the origin of the negative resistance characteristic and 
then construct a simple equivalent circuit for a tunnel diode. To do so, we 
must first give an elementary discussion of semiconductors. 

As is well known, light is one form of electromagnetic energy that prop- 
agates as a wave. However, when it strikes a photoelectric surface, it behaves 
as though it were a particle. To express this corpuscular nature of light, 
the name photon is ordinarily used. The photon is considered to be asso- 
ciated with the light wave. The probability that a photon is found at a 
certain place is proportional to the square of the wave amplitude at that 
point. In much the same way, an electron is also associated with a wave. 
Inside an atom, a standing wave is formed for each electron around the 
nucleus, and the probability of finding an electron at a certain point is 
proportional to the square of the amplitude of the corresponding standing 
wave at that point. Only certain functions are acceptable for representing 
such standing waves since each has to be a single-valued continuous function 
of position while its integral over the whole space must be finite. The electron 
associated with a certain standing wave has a certain energy determined by 
the corresponding wave function. Therefore, in a steady state, electrons 
inside an atom can have only discrete energies. This situation is similar to 
that of a resonant cavity in which only certain field patterns are allowed to 
exist, and each has its own resonant frequency. 

If we consider the electron as a particle, it has a certain potential energy 
due to the attractive force from the positive charge of the nucleus, and 
kinetic energy due to its motion inside the atom. The graph of potential 
energy as a function of the distance from the nucleus should look similar to a 
morning-glory flower whose cross section is shown in Fig. 7.13. Suppose 
that the electron has total energy £, and that a horizontal line is drawn at 
E= E l . The region between the two intersections with the potential curve 
represents the space within which the electron can exist as a classical particle. 
At each intersection, all the kinetic energy is replaced by potential energy, 
and hence the particle with energy £, is not permitted to move beyond this 
point. It could be imagined that there is a fictitious “potential wall” along 
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the potential curve. However, the standing wave associated with the electron 
does not necessarily vanish beyond the potential wall since around the atom* 
there is nothing which behaves like a perfect conductor to reflect the wave 
completely. This means that there is a finite probability of finding the 
electron beyond the potential wall; therefore, the horizontal solid line 
representing the space within which the electron is allowed to exist should 
be considered to extend beyond the potential wall, tapering off quickly with 
increasing distance. 

There is a limitation to the number of electrons which can belong to each 
wave function, i.e., which are associated with the same standing wave 
pattern. No more than two electrons can belong to one wave function, and 
they must have the opposite spin orientations if they belong to one wave 
function. When two electrons belong to one wave function, an additional 
electron has to belong to another wave function, generally with different 
energy. Furthermore, the first two electrons usually belong to the wave 
function with the lowest energy, and the next two electrons to the wave 
function with the next lowest energy, and so forth, until all the electrons in 
the atom belong to certain functions with the lowest possible energy. 

So far we have discussed electrons in a single atom. Next, we discuss many 
atoms which come close together to form a crystal. As atoms approach one 
another, the electron wave functions of different atoms start to interfere 
with each other, and each energy level splits into many different levels. This 
is similar to the split of the propagation constant of two identical waveguides 
coupled together. When the crystal is formed, some of the wave functions 
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extending over large distances from the nuclei interfere with each other 
so strongly that they lose their identity. In other words, it is no longer clear 
to which atom they belong, and they spread over the entire crystal. Their 
energy levels are crowded together and occupy some form of band as shown in 
Fig. 7.14. Each horizontal line roughly shows the range where the electron 
belonging to the corresponding wave function can exist. Each wave function 
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Fig. 7.14. Energy bands in crystal. 

can accommodate no more than two electrons as before; and, under this 
condition, electrons try to belong to the wave function with the lowest 
possible energy, or equivalently, to be in the lowest possible energy state. 
Therefore, all the wave functions up to a certain level are filled with electrons, 
and wave functions above that level are left empty. However, if the atoms 
vibrate strongly due to heat, some of the electrons obtain energy from the 
vibration and jump up to the higher energy states which were originally 
empty. The probability f(E) that electrons occupy a certain energy state 
therefore appears as shown in Fig. 7.15. At absolute zero temperature, the 
probability is unity up to a certain level and zero above that level as shown 
in Fig. 7.15(a). As the temperature increases, the probability that electrons 
take high energy states increases, and the probability for low energy states 
correspondingly decreases as shown in Fig. 7.15(b). This is the Fermi- Dirac 
distribution law. The horizontal dotted line in Fig. 7.15(b) shows the Fermi 
levels where the probability becomes one half. 

Various phenomena of electric conduction can be explained using the 
energy bands described above and the Fermi- Dirac distribution law. Ordinary 
metals have an energy band which is only half filled by electrons with the 
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(a) (b) 

Fig. 7.15. Probability /(£) of electron occupancy of available states: (a) T — 0°K; 
(b) At a higher temperature. 



upper part left empty. When an electric field is applied, the electrons in this 
band move in the opposite direction associated with the wave packets made 
up of several empty wave functions just above the original states. A large 
field is not necessary to start net electron motion or conduction; hence, 
metals are good conductors. The half-filled energy band is called the con- 
duction band. 

For insulators, one energy band called the valence band is completely 
filled with electrons and the conduction band above it is empty. The energy 
gap between them is so large that the electrons in the valence band can 
hardly be excited into the conduction band. Since there is no room for 
electrons to move in the valence band and no electrons exist in the conduction 
band, electric current cannot be induced by an ordinary electric field, which 
is the basic property of insulators. 

For intrinsic semiconductors, the energy gap between the valence and 
conduction band is small, and some electrons from the valence band can be 
excited into the conduction band by heat leaving behind some room for 
electrons in the valence band to move. The vacancy each electron leaves 
behind in the valence band, when it is excited into an upper level, is called a 
hole. When electrons in the valence band move about using these vacancies, 
the holes may be considered to move in the opposite direction. Holes, 
therefore, behave like particles each with a positive electric charge e , where 
— e is the electron charge. When an electric field is applied, electrons and 
holes move in the conduction band and valence band, respectively, and 
contribute to the conduction current. However, the number of electrons 
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and holes available for this net motion is small, and the conductivity is 
small compared to metals, which is the reason for the name semiconductor. 

For impurity semiconductors, Fig. 7.14 is modified by the presence of 
impurity atoms as shown in Fig. 7.16. When the energy level of an empty 
wave function of the impurity atom is located just above the valence band as 
shown in Fig. 7.16(a), the semiconductor is said to be a /t-type. On the other 
hand, when the energy level of a filled wave function of the impurity atom 
is located just below the conduction band as shown in Fig. 7.16(b), the 
semiconductor is said to be an «-type. Electrons in the valence band in a 
p-type semiconductor can be excited by heat to jump into the empty level 
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Fig. 7.16. Energy bands and impurity levels in impurity semiconductors: (a) /Mype 
semiconductor; (b) w-type semiconductor. 
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of the impurity atoms creating holes in the valence band. Similarly, the 
electrons of the impurity atoms in an n-type semiconductor can jump into 
the conduction band ready to participate in electric conduction. In either 
case, conduction current takes place when an electric field is applied. If we 
indicate the band edges and the energy levels of the impurity atoms, neglecting 
the potential walls of atoms as well as the detailed structure in the atoms. 
Figs. 7.16(a) and (b) become Figs. 7.17(a) and (b), respectively. The Fermi 
level of each case is indicated by the dotted line. At room temperature, 
some of the impurity atoms in the p - type semiconductor receive electrons 
from the valence band and are negatively charged or ionized. On the other 
hand, some of the impurity atoms in the n-type semiconductor give up elec- 
trons to the conduction band and are positively ionized, as illustrated in 
Fig. 7.17. Although the impurity atoms are charged, the total number of 
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Fig. 7.17. Energy band structures of impurity semiconductors: (a) /Mype semi- 
conductor; (b) n-type semiconductor. 



electrons in the crystal remains the same, and hence the crystal as a whole 
is neutral. 

Now, suppose that the p-type and n-type semiconductors come into con- 
tact. If the Fermi level of the n-side is higher than that of the p- side, electrons 
move from the n-side into the p - side where more empty states are available at 
the same energy level; the p-side then becomes negatively charged and the 
n-side positively charged. Since the vertical axis of our diagram indicates the 
energy of electrons, the potential is lower towards the top. Thus, the nega- 
tively charged p - side rises as a whole and the n-side goes down, as shown 




in Fig. 7. 1 8, until the Fermi levels on both sides coincide. Under this con- 
dition, the probabilities that available states are occupied by electrons are 
the same on both sides at each energy level, and no further transfer of 
electrons takes place. Electrons near the junction in the «-region are trans- 
ferred into the p-region where they recombine with holes and disappear. 




Thus, free charge carriers near the junction are depleted, and the charged 
impurity atoms build up a strong field which provides the necessary potential 
difference to make the Fermi levels on both sides coincide. 

If we connect a battery to make the p-side negative and the n-side positive, 
the p-region rises in Fig. 7.18, the n-region goes down, and at the same time 
the depeleted region becomes thicker to accommodate the externally applied 
potential difference. Both positive and negative charges in the depleted region 
increase as the region becomes thicker, and since the charges increase as we 
increase the applied voltage, the junction behaves like a capacitance. This 
capacitance is called the depletion layer capacitance. 

If we connect the battery in the opposite direction, the p-region goes down 
and the n-region rises, in Fig. 7.18. Electrons then move from the n-region 
into the p-region where empty states are available abundantly at the same 
energy level. Since electrons move from the n- to p-sides, net current 
flows from the p- to n-sides. This is called the forward direction of the 
junction, and the forward current increases as we increase the applied 
voltage. When the current flows, excess electrons (holes) in the p-region 
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(^-region) build up a net negative (positive) charge which increases with 
increasing current, and hence with increasing voltage applied to the junction. 
This stored charge introduces another capacitance effect called the diffusion 
capacitance. The depletion layer and diffusion capacitances combined are 
called the junction capacitance. 

Suppose that the number of impurity atoms increases so that one out of 
every 1000 atoms in the crystal, or equivalently, one out of every ten atoms 
in one direction is an impurity, then the wave functions of the impurity 
atoms begin to interfere with each other, and their energy level splits thus 
forming a band structure, as before. This energy band is connected to the 
bottom of the conduction band in the case of n-type semiconductors and 
to the top of the valence band in the case of p-type semiconductors. The 
resultant conduction or valence band is now only partially occupied by 
electrons and behaves like the conduction band in metals. The conductivity 
of such semiconductors is therefore high. Furthermore, the depletion layer 
at the junction between two such semiconductors becomes very thin since, 
with the high density of ionized impurity atoms, only a thin layer is required 
to build up the potential necessary for the Fermi levels on both sides to 
coincide. Under this condition, the wave functions on the opposite sides of 
the depletion layer interfere with each other. The interference may be weak; 
nevertheless, just as we saw in the discussion of coupled modes, the amplitude 
of one wave is transferred to the other through the interference, which means 
the particle in one region is transferred to the other through the interference. 
Even when no voltage is applied to the junction, electrons are constantly 
moving from the p- to ^-region and back due to the wave interference. 
However, since the number of electrons moving in one direction is equal 
to the number moving in the opposite direction, no net current is obser- 
ved. 

Let us calculate the number of such electrons moving back and forth 
through the junction. To do so, let q{E) be the number of wave functions 
in a unit energy interval, counting twice each function for electrons with 
opposite spin orientations. Let /(£) be the probability of available wave func- 
tions being occupied by electrons at energy level E. Subscripts n and p will 
be used to express the quantities in the n- and p-regions. We first consider 
the transfer from the n - to the p-region. The number of electrons in the 
w-region in a small energy interval AE is given by q„{E) A Ef„(E), the 
number of available states times the probability of occupancy. The number 
of available empty states in the p-region is given by q p {E) AE{ 1 —f p {E)}. 
Let a be the probability that an electron on one side is transferred to an 



A 



¥ 




empty state on the other side. Then the number of electrons moving from 
the n- to ^-region in AE is given by 

a Q „(E)AEf n {E)Q p (E)AE{i -/„(£)} 

The transfer with a large energy change is unlikely since it corresponds to 
the interference between two waves with quite different propagation con- 
stants. If we neglect the effect of such a transfer, the total number of electrons 
moving from the n- to the p-region is given by 

I (“ AE) a ,(E)f,(E)e p (E ) {1 - /,(£)} AE 

E 

Similarly, the total number of electrons moving from the p- to the n-region 
is given by £ {a AE) g p{E )f p (E) e„(£) {1 - L(E)} AE 

E 

Therefore, remembering that the electron charge is negative, the net current 
from the p- to the n-side must be proportional to 

I (« AE) g „ (£) e r (E) {/„ (£) - /„ (£)} AE 

E 

When the Fermi levels on both sides coincide, /„(£) is equal to f p {E) 
for each E and the net current vanishes. If a small forward bias is applied 
and the p-side potential is raised to lower the Fermi level, f n (E) — f p (E) 
no longer vanishes in the vicinity of the Fermi levels and some net current 
results. At the edges of the energy bands q p (E ) and q„(E) generally look like 
Figs. 7.19(a) and (b), respectively, in which the Fermi levels are shown by 
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Fig. 7.19. Explanation for the origin of tunnel current. 
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the dotted lines. Figures 7.19(c), (d), and (e) show the relation between 
/„(£) and f p {E),f n {E) -f p (E), and q„(E)q p (E), respectively. From these 
figures, we see that g n (e) Q p {e) { f„(E ) — f p (E)} should look like Fig. 7.19(f). 
Since this has nonzero values only over a narrow range E, the transfer 
probability a can be considered to be a constant over this range, and hence 
the net current is proportional to the area inside the curve. If the applied 
voltage is increased further, the range over which f n {E) — f p (E) does not 
vanish increases. On the other hand, the range over which q„(E)q p (E) has 
nonzero values decreases. When the overlap of two energy bands disappears, 
the area under the curve q„(E)q p (E) {f n (E)-f p (E)} vanishes and the net 
current disappears. When reverse voltage is applied to the junction, 
{/„(£■) —f p (E)} becomes negative, and the range for nonzero Q„(e) q p (E) 
increases rapidly with increasing reverse voltage. Consequently, the net 
current due to the wave interference through the junction varies with the 
applied voltage as shown by the solid line in Fig. 7.20. The dotted line in 
Fig. 7.20 shows the current through the junction due to the mechanism 




Fig. 7.20. Tunnel current versus applied voltage at the junction. 

explained in connection with Fig. 7.18. If we add these two current compo- 
nents, we obtain the I-V characteristic shown in Fig. 7.12. 

Classical particles cannot exist behind a potential wall; however, due to 
their wave nature, electrons can exist behind the wall where classical 
particles are forbidden. This effect is called the tunnel effect since electrons 
can penetrate the potential wall as if there were a tunnel through it. The 
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tunnel diode is named after this effect since it utilizes a similar wave nature 
of electrons. 

In addition to the negative resistance discussed above, the junction of a 
tunnel diode has capacitance as do other types of junctions. Furthermore, 
the junction current flows through the semiconductor regions which act as a 
series resistor. Consequently, an equivalent circuit for the diode becomes the 
parallel connection of the negative resistance and the junction capacitance 
in series with the small positive resistance r. In addition, there might be a 
small series inductance due to the lead wire or whisker connected to the 
diode wafer inside the capsulation. However, it can be included in the external 
lossless circuit, and hence it is not considered here. 

Let us investigate the possible noise sources in the diode. The small 
resistance r representing the semiconductor body is an ordinary resistance 
and has an available noise power kTB when its temperature is T° K. In the 
series representation, the mean square value of this noise voltage is therefore 
given by 

<e 2 > = 4kTBr (7.70) 

In addition, we must consider the shot noise introduced when current flows 
through the junction. Suppose electrons traverse a certain gap which is not 
conductive. As each electron approaches the anode an electric charge is 
induced in the anode which is neutralized by the charge of the electron when 
it arrives. Hence, the current flowing into the anode consists of a large 
number of short pulses as illustrated in Fig. 7.21. The average of this 
current gives the dc component I 0 through the gap. At the same time, a 
noise component is produced since the position of each pulse is entirely 
random ; this noise is large when I Q is large. A statistical discussion, which we 
shall not present here (see Problem 9.5) shows that when the pulse width is 




t 

Fig. 7.21. Explanation of shot noise. 
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sufficiently narrow compared to one cycle of the frequency of interest, the 
mean square value of the noise current is given by 

<i„ 2 > = 2 eI 0 B (7.71) 

This is called the shot noise. The depletion layer at the junction is not 
conductive, and it produces the shot noise given by (7.71). For / 0 , we have 
to add the currents due to electrons flowing from the p- to the ^-region in 
addition to those flowing in the reverse direction. Consequently, I 0 is 
generally larger than the terminal current / which is given by the 1-V 
characteristic of the diode. For most diodes, however, I 0 is only 20-30% 
larger than 1 at ordinary operating points, and, for some good diodes, the 
discrepancy is considerably less. If we include these noise sources in the 
equivalent circuit of the diode, we obtain Fig. 7.22. 




Fig. 7.22. Eqlivalent circuit of a tunnel diode. 



Amplification is possible only when the real part of the impedance 
looking from the terminals is negative. The impedance Z is given by 

z = r+ =r _ _. (OCR 2 

- (1 IR) + jcoC ‘ 1 + (c oCR) 2 J 1 + (coCR) 2 1 ' 

In order to make the real part negative, co must be lower than 

o> c = (C*r‘ {(R-W 2 (7.73) 

where co c is called the cutoff angular frequency beyond which the diode 
becomes a passive element. If R > r, as is usually the case, (7.73) reduces to 

<o c ca C" 1 (Rr )~ 1/2 (R>r) (7.74) 
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In order to estimate the optimum noise measure M opt obtainable with a 
tunnel diode, let us next calculate the open-circuited noise voltage at the 
terminals. Let v n represent the noise voltage due to the shot noise, then we 
have 



K ~ (1 IR) + jcoC 



and hence 



W> = 



1 + (coCRy 



The noise voltage due to the series resistance r and the shot noise voltage 
are independent of each other. The mean square value of the total noise 
voltage <T„> is, therefore, given by 

<K 2 > = <e n 2 ) + <U„ 2 > = 4 kTBr + _ — 2 el 0 B 

1 + ( coCR y 0 

The exchangeable noise power P e is equal to <F„ 2 > divided by four times 
the real part of Z in (7.72), and from the discussion given in Section 7.2, 
M op , becomes 



M opl = 



~Pe = __ O'/) _1 
Tfi 4R eZkTfi 

r ^ R (( 0/(Oc ) 2 

R — r R — r 1 — (c o/a) c y 



which reduces to 



M ~ 

op ‘ 1 - (COM 2 T, T (2km 1 - (c oM 2 

when R > r. 

If we draw a straight line tangential to the I-V characteristic at the 
operating point as shown in Fig. 7.12 and let V t be the intersection with the 
F-axis and V 0 be the bias voltage for the operating point, then the difference 
V\ - V 0 gives IR. Since I 0 is generally 20-30% larger than /, I 0 R will be 
obtained by increasing the value of//? by the same percentage. In addition, 
2 kTJe is approximately equal to 0.05 V when 7] = 290 °K. If the diode is at 
room temperature, we may assume that T~ 7}. Substituting these values in 
(7.76), M opt can be calculated as shown in Fig. 7.23. The abscissa shows cojco c 
and the ordinate M opt times T ( . [F x = 10 log 10 (M + 1); 7; = 290°K]. From 
the curve corresponding to an appropriate value of I 0 R, the best M opc 



(Me) 2 



T i R — r (IkTJe) 1 — (co/to c ) 2 

(7.75) 

I 0 R 1 

; -r-r^z (7-76) 
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tL)/OJ c 

Fig. 7.23. A/opt x Ti versus cojco c with loR as a parameter. 
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obtainable with a given tunnel diode can be readily estimated. To realize f 
M 0 p „ the diode impedance is transformed to an appropriate value Z T 
through a lossless circuit and is then connected to the generator and load 
through a lossless circulator as shown in Fig. 7.24. If the reference imped- 
ance of the circulator at port III is Z 3 and Z T is connected to this port, the | 
gain is given by 

G = |(Z r -Z 3 )/(Z T + Z 3 »)| 2 1 
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as .we discussed in Section 5.8. If a larger gain is desired, the circuit must be 
adjusted to make — Z T close to Z 3 * without exciting possible oscillations. 

In practice, the circulator as well as the impedance transforming circuit 
has some insertion loss; hence, the noise measure realizable with a given 
tunnel diode is always worse than the value calculated as above. The 
circulator insertion loss is usually of the order of 0. 1-0.3 dB, and the noise 
temperature ratio T ap jT t of the amplifier in dB increases by approximately 
the same amount provided the circulator temperature is close to T r On the 
other hand, the transducer gain decreases by twice this amount. Taking 
these factors into account, the noise measure obtainable in practice can be 
estimated from (7.37). 

7.6 Manley-Rowe Relations 

A circuit element which destroys the linear relationship between the 
terminal voltage and current is described as being nonlinear. When a non- 
linear element is lossless, it is called a nonlinear reactance. Suppose voltage 
at angular frequencies o) s and co p are applied to a nonlinear reactance, then 
we expect that harmonics at frequencies expressible in the form |mco s + na> p \ 
may appear, where m and n are positive or negative integers. Let us assume 
that no other frequency components exist and that the two voltage sources 
are independent. Then we have 




(7.78) 
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where P m „ represents the net power into the element at | mco s + nco p \. These 
two relations are called the Manley-Rowe relations, and we shall derive 
them for a nonlinear capacitance using the relation 



v=f(q) 



(7.79) 



where v is the terminal voltage, q is the stored charge, and f(q ) represents a 
single-valued function of q. In the derivation for a nonlinear inductance 
case, we have only to replace (7.79) by i = f{<p), where i is the terminal 
current and (p is the magnetic flux through the inductance. 

By hypothesis, the voltage has nonzero components at \moj s + n<o p \ 
only, i.e., 



” = Z Z A m, n cos (m(D s + nco p ) t + B mn sin (mco s + tuo p ) t 



m = 0 n= — co 






(7.80) 



m = - oo n = - oc 



where 



= V- m ,- n = (A mtn +jB m?n )IJ 2, V 0<0 = J2A 0 ' 0 

The factor in (7.80) is introduced so that V mi „ can be interpreted as the 
effective value of the voltage at mco s + nw p . Note that 

V = V* 

r m, n — m, ~n 

Similarly, the stored charge can be written in the form 



4 = Z Z (Q.M- 2)e 



j(mco 5 + n<o p )t 



m = — go n — — cc 



where the Q m p s are expansion coefficients and 

Qm.n = Q-m,-* 

Let us assume that the infinite series 



(7.81) 






m— — oo n= — oo 



converges uniformly, then the current can be expressed as: 



i = (dq!dt)= Z Z (LM 2 )? 



j(mo)s + noi p )i 



m = — oo n = — go 



where 



L,n=j(m(D t + nco p ) Q m ,„ 
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The average power flowing into the nonlinear capacitance at | mo) s + nco p \ 
is given by 

P m . „ = Re { V m , „/* .} = i(V m . X, n + K X , 0 (7.83) 

From the lossless condition, the total power into the capacitance must be 
equal to zero; 

Z^. = « (7.84) 

m, n 

where the summation is taken to include the power corresponding to each 
frequency only once. To facilitate the following calculation, let us rewrite 
(7.83) as follows: 

Pm,n= W m ,„+ W_ m> (7.85) 

where 

Wm,„ = HV m ,„i:,n+ V* „/„,„) 

= *(»*©. + ™ P )(-jV m , n Q* m ,„ +jV*„Q m J. (7.86) 

Then, (7.84) can be rewritten in the form 



w m , n ~o 



or equivalently, 



m — — co n — - co 



co OC 00 oo 

>. V V — nW ”—+o) P y y -^-.0 ( 7 . 87 ) 

u Li ma) s + Li Li m(a s + na) P 



m = — co n=-oo 



m= — co n = — co 



Since (7.79) holds regardless of the values of a> s and co p , we see from (7.80) 
and (7.81) that each V m n can be considered as a function of the g mM ’s only. 
Then, (7.86) shows that W m J(mco s + n(o p ) is a function of the Q m „’s only 
and is independent of a) s and c o p . Suppose we vary oj s and co p independently 
while keeping the Q mr „'s the same (the external circuit condition will have to 
be changed as a) s and a) p change in order to accomplish this), then from 
(7.87) we see that 



mco s + mo. 



m — — oo n= — oo 



y y — 

U U mco s + n(o p 



(7.88) 



(7.89) 



m — — co n — — co 



Otherwise, (7.87) is not satisfied during the variation. Equation (7.88) is 
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equivalent to 



CO go 

IE 

m— 0 n— — co 



mco, + no r 



u oo 

1 1 

m= — co n- — co 



mw, 4- nro r 



(7.90) 



Changing the signs of m, n and using the relation W m n = W_ m< which 
can easily be derived from (7.86), the second term on the left-hand side 
becomes 



y y mw mtH y y 

L L ma) s + nco p u L 



mco, + nco. 



m = 0 n= - co 



Substituting this into (7.90), we obtain (7.77). Similarly, from (7.89) we 
obtain (7.78). This completes the derivation of the Manley-Rowe relations. 

Let us consider some simple applications of the Manley-Rowe relations. 
Suppose that a large voltage at (o p is applied to a nonlinear reactance, for 
example, to the junction capacitance of a diode, and a small signal voltage 
at co s is superimposed. The large voltage is called the pump voltage, and co p 
the pump (angular) frequency. If the nonlinear reactance is reactively 
terminated at all frequencies except o) p , a) s , and co u = co p + co s , no net power 
flow takes place except at these three frequencies. Rewriting P 0(1 , P 10l , 
and P l t i as P p , P 5 , and P u , respectively, the Manley-Rowe relations become 

(PM) + {Pjw u ) = 0 (7.91) 

(P> p ) + (PJco u ) = 0 (7.92) 



From (7.91), we have 



P u = - P s (o)Jo) s ) 



This equation shows that P u is negative when P s is positive, where P u is the 
power flowing into the nonlinear reactance at o u . This means that the 
external circuit receives a net power at oj u when the signal power is flowing 
into the nonlinear reactance. If co u > co s , the output power at co u can be 
made large compared to the available input power at co s , thus providing 
power amplification. Because of the frequency shift, amplifiers of this type 
are called up-converters. The discussion in Sections 7.1, 7.2 and 7.4 does 
not apply to up-converters since input and output frequencies are different 
in the latter. Combining (7.91) and (7.92), we have 

P p + P s = -P u {(a> P M + ( coJw u )} = - p u 
where a> u = a> p + co s is used. The above equation shows that the output 
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power is made up of the signal and pump powers as expected from the 
lossless assumption. 

Next, suppose that the nonlinear reactance is reactively terminated at all 
frequencies except co p , a> s , and a > t = co p — a> s , then the Manley-Rowe relations 
become 

(PJio,) - ( p,la> ,) = 0 (7.93) 

(P>„) + (PJm,) = 0 (7.94) 

where P_ l t is indicated by P t . If the external circuit is passive at o)„ P t 
must be negative, and hence P s becomes negative also from (7.93). In other 
words, a net power is flowing out from the nonlinear reactance at co s , 
which is given by 

- p s = ~ 

If we adjust the external circuit properly so that \P t \ becomes large, then | P s \ 
will also be large. Therefore, as far as the circuit at co s is concerned, the 
nonlinear reactance is behaving as a negative resistance, and an ampli- 
fier can be built utilizing this property. Such an amplifier is called 
a negative resistance parametric amplifier. For negative resistance para- 
metric amplifiers, the output and input frequencies are identical, and 
all the discussions given in the previous sections are directly applicable. 
The circuit absorbing power at co h which is an essential part of the amplifi- 
cation process, is called the idler circuit, and cu, the idler frequency. This 
name is appropriate since the power at co, is not directly utilized. Combining 
(7.93) and (7.94), we have 

- (p, +p,) = - {(<«>,) + 1} p, = {<»>,)+ 1} p» p ) = p„ 

which indicates that the total power flowing out from the nonlinear reactance 
at cu s and co p is supplied by the pump, as expected. 

7.7 Negative Resistance Parametric Amplifiers 

In this section, we shall study negative resistance parametric amplifiers 
which utilize the variable junction capacitance of a diode. Let v be the 
voltage across the junction capacitance and q be the stored charge. The 
relation between v and q is given by (7.79). Suppose a large voltage v p 
having a fundamental frequency a) p is applied to the junction and the 
corresponding charge is given by q p . Then 



v p = f(<lp) 
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The charge q p is a periodic function of time having a fundamental frequency 
c o p . Suppose a small signal voltage at co s is superimposed which changes the 
junction voltage v p and charge q p to v p + dv and q p + Sq, respectively. Then 



from which we obtain 



v p + Sv = f(q p + Sq) 



Sv J f ^s q 



(7.95) 



(7-96) 



Since q p is a periodic function of time and / is a single-valued function, 
df/dq p is also a periodic function of time having the same fundamental 
frequency co p . Choosing the time origin properly, we can write dfjdq p in the 
form 



dfiQp) 



— 1 cos Q) t -1 cos 2(0 A + 

Kn K t p Kr 



(7.97) 



which is simply the Fourier series expansion of the periodic function. Note 
that K 0 , K u K 2 ,... have the dimension of capacitance. 

The functions Sv and Sq can be expressed as follows: 



Sq-Z' 



where 



v m „ = 



tfm, n Q -m, —n 



(7.98) 

(7.99) 

(7.100) 



and indicates the summation with respect to m and n from — oo to + oo, 
excluding those terms for which m = 0. The junction current Si corre- 
sponding to is given by 



j(ma>, s + (i<tf p )f 



where 



(L.JyP)e 



i m ,n=j(™u>s+ n(0 p) Qm, n 



(7.101) 

(7.102) 



Suppose that the external impedance is properly adjusted so that the 
current can be neglected at all frequencies expressible in the form \mco s + na) p \ 
except co s and co, = co p — o) s . Then only four of the < 7 m> „’s (m / 0) remain 
nonvanishing: q l 0 , q- 1>0 , and Because of (7.100), only two 

of them are independent so let us try to write the voltages at w s and cn, in 
terms of these independent variables. If we substitute (7.97), (7.98), and 
(7.99) into (7.96) and compare the terms with exp{ja) s /} on both sides, we 






7.7. Negative Resistance Parametric Amplifiers 



339 



obtain 

v u o — 0/^o) Q i, o + (1/2/Ci ) q lt _ j 
Similarly, comparing the terms with exp (y(to s - co p ) t }, we have 

v t,-i=^/K 0 )q 1 .- i +(\/2K 1 )q lt0 
Using (7.102), the above result can be rewritten as follows: 



v s = I, — i, 

ja) s K 0 jco,(2K } ) 

* 1 .* 1 
v, = - if + — — »• 
j(o,K 0 j(o s (2K l ) 



where u 1>0 and / 1>0 are indicated by v s and i s since they are the voltage and 
current at oj s . Similarly, n_ 1(1 and /_ ia are indicated by v, and i„ respectively. 
In addition, there may be nonzero voltages at other frequencies. However, 
since the corresponding currents are zero by hypothesis, no net power 
transfer takes place, and hence we do not bother with them. 

From (7.103) and (7.104), we see that an equivalent circuit of the junction 
is given by Fig. 7.25, in which the c o s and co p circuits are drawn separately 




Fig. 7.25. Equivalent circuit of a parametric junction. 



and \/j(D s K 0 and 1 /jca,K 0 are the impedances due to the series capacitance 
K 0 which play no essential role in the parametric amplification. The terms 
containing 2K i which are drawn in the form of voltage sources are respon- 
sible for the amplification. Figure 7.25 represents the diode junction only, 
but in practice the junction current flows through the semiconductor part 
which acts as a small^ but finite, series resistance r. To take its effect into 
account let us define Q s and Q t by 

& = (1 /a>£Kir), Q, = (1/ft), 2^ r) 



(7.105) 
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respectively. Remember that the Q of an ordinary capacitance C is defined 
by 1 /coCr, and that the 0’s defined above have 1K X in place of C, where 
2 K y represents a capacitance component changing with time, it is therefore 
logical to call § s and Q, the dynamic 0 of the variable capacitance at a> s 
and w h respectively. Suppose that a load impedance is connected to the 
co, circuit and let Z, represent the total impedance due to 1 fjoj,K 0 and the 
load impedance in series, then the co, circuit becomes the series connection of 



r, Z h a voltage source 




and a possible noise voltage v„ as shown in Fig. 7.26. If we assume that both 




Fig. 7.26. Equivalent idler circuit. 



r and R t = Re { Z ,} are at temperature T and that v„ represents the thermal 
noise from these resistances, 0„ 2 > is given by 

<v n 2 y = 4kTB{r + R l ) (7.106) 



From Fig. 7.26, /, is calculated to be 

h = - (r + Z,)' 1 v„ -jQ s r(r + Z,)“ 1 i* 

The equivalent voltage source in the co s circuit shown in Fig. 7.25 is, therefore, 
given by 



;'co,2Xi 



= j&ri? =~j&r(r + Z/V *.* ~ M/O’ + Z,*)" 1 i s 



(7.107) 



The second term on the right-hand side of (7.107) expresses the voltage 
developed across the impedance — 2i2/ 2 ( r + Z,*) _1 when i s flows. Con- 
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sequently, the equivalent circuit looking into the diode at co s becomes as 
shown in Fig. 7.27, where <e n 2 > is given by 

<e fl 2 > = 4 kTBr (7.108) 

In order to obtain amplification, the real part of the impedance must be 
negative : 

Mr + *)-'}<() (7.109) 

l_ 



K K o 




Fig. 7.27. Equivalent circuit of the diode at an. 

The second term in the bracket becomes maximum when Z* is equal to zero 
Setting Z,* = 0, (7.109) reduces to 

'•(l~2/2s)<0 (7.110) 

If cu, and w s become large and Q,Q 5 becomes less than unity, (7.110), and 
hence (7.109), cannot be satisfied and no amplification is expected. 

Let us next calculate M opl . Since e n and v n are independent, the mean square 
value of the open-circuit noise voltage V n is given by 

<^ 2 > = <^> + < y „ 2 >|yg,A-| 2 |r + z I |- 2 

Substituting (7.107) and (7.108) and writing Z, in the form R t + jX h we 
have 

<F n 2 > = AkTBr + 4kTB(r + R ,) &V {(r + Rf + X, 2 }' 1 (7.111) 

The exchangeable noise power P e is given by <F„ 2 ) divided by four times 
the real part of the equivalent impedance of the diode shown in Fig. 7.27. 
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Since the real part is given by 



we have 



r ~ Q&r 2 (r + R,) {(r + R,) 2 + X, 2 }- 1 

_~P, T g,V(r + «, ) + (r + K,) 2 + ;y , 2 
° pl kT t B T i M,r(r + R,)-{(r + R,f + X 2 } 



(7.112) 



This is the M opt for fixed R t and X t . If X, is allowed to change, the M opl 
reaches a minimum when X 2 — 0, since the numerator increases and the 
denominator decreases as X 2 increases from zero. This minimum value of 
K P t is given by 

T g, 2 r + (r + K,) 

OP ‘ T, M,r ~(r + R t ) 

Furthermore, if R t is allowed to change, the best value of M opt is attained 
when R t = 0 since the numerator increases and the denominator decreases 
as R t increases from zero. The best M opl is given by 



M opt = 



T Q 2 + 1 

Tt Mi ~ 1 



which is obtainable when, and only when, 

Ri = 0, X, = 0 



(7.113) 



(7.114) 



or equivalently, when the external idler circuit is an inductance which just 
cancels \(jo)[K 0 . As in the case of tunnel diode amplifiers, M op , can be 
realized by connecting the generator and load to the diode through a circu- 
lator. The gain can be adjusted to a desired value, by inserting a proper 
lossless circuit between the circulator and the diode. 

If T = T ( , Eq. (7.1 13) is a function of Q s and Q t only. Since Q , is deter- 
mined from Q s and a),/c o s , M opt x T x can be plotted as a function of Q s with 
(dJcOs as a parameter. Figure 7.28 shows such a plot. From this, we see that, 
for fixed Q s , there is an optimum value of co,/a> s which gives the minimum 



Note that T is the diode temperature, and that a further improvement of 
M op( is expected if the diode is cooled. For example, the value of M opl x T, 
becomes 77.2/290 times the value shown in Fig. 7.28 when the diode is 
cooled to the temperature of liquid nitrogen, and 4.2/290 times when 
cooled to the temperature of liquid helium. It should be pointed out that 
Q s of a diode is a function of temperature, and hence the room temperature 
value must not be used in the refrigerated cases. Furthermore, it should be 
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2 4 6 8 10 20 40 60 80 100 



Fig. 7.28. A/opt x Ti versus Q s with coi/io s as a parameter. 

pointed out that noise originating in the circulator may predominate in 
such a low noise application unless the circulator is also cooled. 

PROBLEMS 

7.1 Obtain the canonical form of a triode assuming the equivalent circuit as shown in 
Fig. 7.29. 

7.2 Obtain the condition under which the triode can provide a positive resistance with 
the noise temperature below T. 




Fig. 7.29. Equivalent circuit of a triode. 
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7.3 Calculate the available and transducer gains of the amplifier shown in Fig. 7.30, 
assuming R g = Rl = 50 Q and R = 52 Q. Note : A tunnel diode with negligible 

series resistance r can be represented by the circuit inside the dotted line. -k 

7.4 Suppose that an amplifier A has 50 Q input and output impedances, and let G be the 

transducer gain when the generator and load impedances are (50 + jO) £2. Inserting I 

the amplifier inside the dotted line in Fig. 7.30 in front of amplifier A, calculate the 

overall available and transducer gains of the cascaded amplifier. Also, calculate the , ; 

available and transducer gains of each component amplifier in the cascade connection. 




Fig. 7.30. An amplifier for which M e = M op t but M is poor. 



7.5 The exchangeable gain G e of an amplifier is defined by the ratio of the exchangeable 
signal power at the output port to the available signal power from the generator, 
provided that the generator impedance has a positive real part. The exchangeable /$ 

noise figure F e is the ratio of the exchangeable noise power at the output port to ;/ 

kTiBGe, when the noise temperature Tt of the generator is specified to be 290 °K. J*' 

The exchangeable noise measure M e is defined by W 



F e ~ 1 

Me = 

1 - (1 IG e ) 

Calculate the possible range of Me following the discussion for M given in Section 
7.2, and show that the smallest positive value of M e is equal to M op t. Also show that 
M > M e whenever M is positive. This means that the evaluation of amplifier noise 
performance by M is generally more critical than that obtained from M e . 




7.6 Calculate M e and M for the amplifier shown in Fig. 7.30, and show that Me of this 
amplifier is always equal to M op \, regardless of the values of R g , Rl, R, and C. 
Although this type of amplifier can give a large available gain over a wide bandwith 
and the noise performance expressed by M e is optimum, it is generally not considered 
to be a good practical amplifier because of its low transducer gain and poor M. 

7.7 Calculate the operating noise temperature of a parametric amplifier assuming the 
output power is taken at the idler frequency. 

7.8 Prove that Q is invariant to nonsingular, lossless transformations. 
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7.9 Let Yi and yi be the eigenvalues of Z + Z + of a two-port network, and prove that 

r = Oh/N) + (w/M) (7.115) 

is invariant to nonsingular, lossless, reciprocal transformations. 

7.10 Prove that the two circuits shown in Fig. 7.31 have the same unilateral gain U but 
that no lossless reciprocal network can transform one to the other. Note-. For these 




Fig. 7.31. Two circuits which have the same unilateral gain but cannot be transformed 
from one to the other by any lossless reciprocal network. 

two circuits, the values of (7.1 15) are different. Consequently, the unilateral gain L/ is 
not a unique invariant to nonsingular lossless reciprocal transformations. 

7.11 Prove that the condition under which both input and output ports of a two-port 
network (IS 12 S 12 I ^ 0) can be matched simultaneously keeping the real parts of the 
source and load impedances positive is given by 

+ | * 5 * 22 ! ^ + 2 IS 12 S 21 I < 1 -f \S12S21 — SinSj#! 2 
When IS 12 S 19 I = 0, the same condition is given by 

l-S’ii] < I , I.S22I < 1 
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CHAPTER 8 

ELECTRON BEAMS 



A stream of electrons in free space is called an electron beam. It plays an 
essential role in most microwave electron tubes such as klystrons and 
traveling wave tubes. In this chapter we shall develop a small signal theory 
for a simplified electron beam model in order to gain physical insight into 
the behavior of the actual beam. It must be pointed out, however, that the 
small signal assumption imposes a rather stringent restriction on the 
applicability of the theory. For example, we cannot use the present discussion 
to estimate some of the important parameters of microwave tubes, such as 
efficiency and maximum output power. 

We shall first construct the small signal model of electron beams and 
study the properties of possible waves propagating along the model. Next, 
the interaction between the electron beam and electrode gaps, as well as the 
continuous interaction with a slow wave structure, will be discussed in 
detail together with an explanation of the signal amplification process in 
microwave tubes. The last section is devoted to a discussion of amplifier 
noise performance that can be expected with a given electron beam. 




8.1 An Electron Beam Model t 

$ 

To construct a simple yet useful electron beam model let us assume that 
the electrons in the beam move in the z direction only. The transverse * 
motion can be restricted, for instance, by applying a high static magnetic | 

field in the z direction. | 

Corresponding to the electron motion in the z direction, there is current | 
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/ z( r > Os where r represents position (x,y,z). Maxwell’s equations then 
become 

VxE(r,/) = -/i~H (r, t) ( 8 . 1 ) 

V x H(r, t) = k i 2 (r, t) + g - E (r, /) (8.2) 

Electrons in the beam are accelerated by the electric field E z (r,t). The 
equation of motion is given by 

7t V(r - ') = - e m E =( r ’‘) ( 8 - 3 ) 

where m is the electron mass 9.108 x 1(T 31 kg, e is the magnitude of electron 
charge 1.602 x 10 19 coulomb, t>(r, 0 is the electron velocity, and djdt 
indicates the total derivative with respect to time. Let d/dt indicate the partial 
derivative with respect to time; i.e., the derivative taken at a fixed point r 
as usual, then we have 

d 8 dz 8 

Substituting the above equation into (8.3) gives 

8 d e 

- v (r, t) + V (r, 0 - V (r, t) = - - £. (r, t) (8.4) 

where dzjdt = t?(r, t) is used. 

The equation of continuity, expressing the conservation of charge, is given 
by 

— | ( 8 - 5 > 

The current density is equal to the product of charge density and electron 
velocity, i.e., 

L ( r > t) = v (r, t) q (r, t) (8.6) 

All the quantities introduced above may have two distinct components, 
the stationary and alternating components. Since the alternating components 
are sufficiently small under the small signal assumption, the product of two 
alternating components can be neglected compared to the product of 
corresponding stationary and alternating components. A consistent model 
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(8.10) and (8.11), we have 

V*E(r) x H*(r) = — jcoftH (r) • H* (r) + jcos E (r) • E* (r) - E z (r) i* (r) 

(8.18) 

Using (8.13) and (8.17), the last term on the right-hand side can be written 
in the form 

E z ( r ) h* ( r ) = - ~ ji^o (r) v (r) v* (r) 

+ jov(r) v Q (r) q * (r) + i* ( r ) ~v 0 (r) i>(r)j (8.19) 
The second term in the bracket is equal to 



y 0 (r)y(r) 



dz 



as is easily seen from (8.15). This can be combined with the last term in the 
bracket of (8.19) to give 

^U 0 (r)y(r) i z *(r) 



Substituting (8.19) and this result into (8.18), and remembering that k-\ 
is the partial derivative with respect to z, we obtain 

V • (E(r) x H* (r) - (m/e) ke 0 (r) v (r) i* (r)} 

= jco (eE (r) -E* (r) - /iH (r) • H* (r) + (m/e) q 0 (r) v (r) v* (r)} 

Let us integrate this equation over a volume V containing the electron beam 
as shown in Fig. 8.1. The left-hand side can be converted to a surface 
integral over S resulting in 

J {E (r) x H* (r) - (m/e) ku 0 (r) v (r) i* (r)} • n dS 

= j(o J {fiE(r)‘E*(r) - /jH(r)*H*(r) + (m/e) Q 0 (r) v(r) u*(r)} dv 

( 8 . 20 ) 

Since the right-hand side is pure imaginary, the real part of (8.20) gives 

Re j E(r) x H*(r) *n dS + Re f (— l)(m/e) ku 0 (r) u(r) i z *(r) *11 dS = 0 

(8.21) 
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ELECTRON 

BEAM 



Fig. 8.1. Region of integration. 



The first term on the left-hand side is the electromagnetic power P flowing 
out from the volume V through the surface S; 



P = Re J E(r) x H*(r)-n dS 



( 8 . 22 ) 



The second term on the left-hand side of (8.21) is related to the motion of 
electrons. Since (8.21) shows that the sum of these two terms is always 
equal to zero, let us interpret the second term as the kinetic power P k 
carried out by the electrons through S. With this interpretation, (8.21) 
indicates the conservation of energy: The electromagnetic power flowing 
out from a volume V is equal to the kinetic power flowing into the same 
volume. Power is expressed ordinarily as the product of voltage and current. 
If we define the kinetic voltage of the electron beam at r by 



V k (r ) = -(m/«) v 0 (t)v(r) 



then the kinetic power is given by 



Pk = Re J v k (r) i* (r) k • n dS 



(8.23) 



(8.24) 



Just as we interpret E(r)xH*(r) as the electromagnetic power density, 
the kinetic voltage times electron current density V k (r) i 2 *(r) can be con- 
sidered as the kinetic power density of the electron beam at r. 

The first two terms on the right-hand side of (8.20), 

J eE (r) • E* (r) dv , J p H (r) • H* (r) dv 

represent twice the average stored electric and magnetic energies, respec- 
tively. The third term 



J (m/e) q 0 (t) v(r) u*(r) dv 
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can be intepreted as twice the average stored kinetic energy. Since e is 
positive and g 0 (r) is negative, expressing the electron charge density, this 
term has the same sign as the stored magnetic energy in (8.20). 

8.2 Space Charge Waves 

Following the discussion of waveguides in Chapter 3, let us assume that 
the microwave components of various quantities vary with distance 
exponentially; i.e., 

E« = (E, + k£ z ) H(r) - (H t + k H z ) e~ y * 

*z( r ) = i,e~ vz , v(t) = ve~ yz , q(t) = Qe ~ yz (8,25) 

where E f , E z , H„ H,, i z , v, and q are independent of z but may be functions 
of transverse position. We further assume that the stationary component 
of electron velocity has the same value everywhere over the cross section of 
the electron beam. This assumption is approximately satisfied after the 
electrons have been accelerated by a high static voltage. Furthermore, let 
us assume that the stationary component of the electron velocity does not 
depend on z; i.e., no static electric field is applied in the region of interest. 



Then, (8.13) becomes 

jwv - yv 0 v = - (e/m) E z (8.26) 

Similarly, from (8.15) and (8.17) we have 

- yi z = -Jwq (8.27) 

h = v oQ + VQo (8.28) 

From (8.26), v is given by 

v = ~ (e/m) (jco - yv 0 )~ 1 E z (8.29) 

By solving (8.27) and (8.28) q can be expressed in terms of v. Substituting 
this result into (8.29), we obtain 

e = JQo (j<» ~ yv 0 Y 1 v = - y Qo (e/m) (ja) - yv 0 )~ 2 E z (8.30) 

Finally, by substituting (8.30) into (8.27), we obtain i z in terms of E z : 

i z = ~jo)Q 0 (e/m)(ja) - yv 0 )~ 2 E z (8.31) 



If E z is equal to zero, i z is also zero from (8.31), and Maxwell’s equations 
(8.10) and (8.11) assume the same forms as in the case in which there is no 
electron beam. This means that TE waves propagate independently of the 




352 



8. ELECTRON BEAMS 



8.2. Space Charge Waves 



353 



electron beam. To study the effect of the electron beam, however, we 
concentrate on the case in which E z is not equal to zero. 

Beginning with (8.10) and (8.11) in place of (3.52) and (3.53), and pro- 
ceeding as we did in Section 3.4, we find that (3.58), (3.59), (3.60), and 
(3.62) are also valid for the present discussion without modification. On 
the other hand, (3.57) and (3.61) are replaced by 

V x H f = jcoe {1 + (o p 2 (/co - yu 0 ) -2 } *-E z (8.32) 

and 

V • E, = {1 + (o p 2 (jco - yv 0 )- 1 } yE z (8.33) 

respectively, where use is made of (8.31) and 

(0 P 2 = ~ teo/e) (elm) (8.34) 

Taking V x (3.60) and rearranging the left-hand side, we have 
yk V • E t + k V • \E Z = j(op VxH, 

Substituting (8.32) and (8.33) into this expression gives 

V • V£ z + (c o 2 ep + y 2 ) {1 + (o 2 (j«> - yv 0 )~ 2 } E z = 0 (8.35) 

A perfect conductor wall gives the boundary condition E z = 0 independent 
of H z . In a similar manner, suppose that a boundary condition for E z is 
given independent of H z , then E z can be obtained which satisfies (8.35) 
under the boundary condition while assuming H z = 0. Once E z is obtained, 
H, and E, are given by 

H, = - j(oe(co 2 ep + y 2 ) - 1 k x VE Z (8.36) 

E t = -y((o 2 ep + y 2 y l \E z (8.37) 

where (3.58) and (3.60) have been used. Furthermore, v, q, and i z can be 
calculated from (8.29), (8.30), and (8.31), respectively. The problem is thus 
reduced to the solution of (8.35) under the appropriate boundary condition. 
It is difficult to give a general discussion of the eigenvalue problem repre- 
sented by (8.35). Hence, we shall discuss a few particularly simple examples 
and imply what kinds of solutions may exist in more general cases. 

First, let us consider a one-dimensional model. There are no field variations 
in the x and y directions, and the first term on the left-hand side of (8.35) 
becomes zero thereby making the second term vanish. If we assume that 
v 0 is equal to zero, the nontrivial solution E z can exist when co — (o p . This 
solution represents a free-running oscillation due to the repulsion force 
between electronic charge and the inertia of electrons. Since free electrons 




in a plasma can oscillate in this way, co p is called the plasma angular fre- 
quency. 

For the electron beam, v 0 is not equal to zero. In order to obtain non- 
trivial solutions from (8.35), either 

(o 2 ep + y 2 = 0 (8.38) 

or 

(1 + (o P 2 (jco~yv 0 y 2 } = 0 (8.39) 

has to be satisfied. From (8.32) and the one-dimensional assumption, E . is 
equal to zero unless (8.39) is satisfied. Therefore, we have only to investigate 
(8.39). There are two different values of y which satisfy (8.39): 

V+ —j((o — (o p )/v 0 (8.40) 

y_ =j((o + (o p )fv 0 (8.41) 

This means that there are two different waves, one slightly faster and the 
other slightly slower than v 0 , corresponding to y + and y_, respectively; 
they are called the fast and slow waves. 

Since \E Z =0 for the one-dimensional model, both E, and H, are equal 
to zero, and no electromagnetic power is transmitted. Substituting (8.40) 
and (8.41) into (8.31), we have 

i z = -j(oeE z (8.42) 

for both fast and slow waves. This indicates that the electronic and dis- 
placement currents cancel each other and explains why H f becomes zero. 

When (8.38) is satisfied, even if E z is equal to zero, E, and H ( may have 
nonzero magnitudes, corresponding to the plane wave in free space. 

Let us next consider an electron beam moving in the axial direction, 
which fills the inside of a waveguide. We assume that co p 2 is constant over the 
waveguide cross section and solve (8.35) under the boundary condition E z = 0. 
From the discussion of waveguides, the eigenvalue problem 

\ 2 E Z + k 2 E z = 0 (in S) 

E z = 0 (on L) 

has an infinite number of solutions, where S indicates the waveguide cross 
section and L the boundary. Let E zn be the nth solution and k 2 the corre- 
sponding eigenvalue. Then E zn itself serves as a solution for (8.35) and the 
corresponding y is obtained from 

K 2 = (co 2 ep + y 2 ) {1 + (o p 2 ( j(o - yy 0 )~ 2 } 



(8.43) 



354 



8. ELECTRON BEAMS 



This is a fourth-order algebraic equation which gives four values of y. Of 
these four values, we are interested in those which have a strong interaction 
with the electron beam. Since their velocities must be approximately equal 
to that of the electrons, we assume 

y±~j(a) + a> q )lv 0 (8.44) 

where ao co q . Substituting into (8.43), we then have 

k„ 2 ~ - (a)/v 0 ) 2 (1 - (o) p lco q ) 2 } 

where co 2 ep is neglected compared to y 2 ^ - (co/v 0 ) 2 since (o 2 ep is equal to 
co 2 divided by the square of the velocity of light, and the electron velocity v 0 
is far smaller than the velocity of light in ordinary cases. From the above 
approximate equation, co 2 is given by 

oj 2 * co 2 {{t 2 !{p 2 + k 2 )} (8.45) 

where 

Pe = (co/Vo) (8.46) 

The phase constant p e corresponds to the phase velocity v 0 . 

Since k 2 is positive, from (8.45), we have 

<o p 2 > CO 2 (8.47) 

A comparison of (8.44) with (8.40) and (8.41) shows that co q plays the same 
role as co p did in the one-dimensional model. For these reasons, co q is called 
the reduced plasma angular frequency. 

We assumed co P oj q in order to derive (8.45). This assumption does not 
contradict (8.47) since q 0 is usually small and co > co p . 

It is worth noting that since E, has the same functional form both for 
the fast and the slow waves, their E f and H, also have the same functional 
forms. However, since their y’s are different, the coefficients in (8.36) and 
(8.37) have different magnitudes for the two cases. On the other hand, i z 
is given by 

i 2 = - jcoe {co 2 !(o 2 ) E z (8.48) 

for both cases. Because of (8.47), the electronic current is larger than the 
displacement current and their difference produces nonzero H, in contrast 
to the one-dimensional model. 

Finally, let us consider a more general case in which the electron beam 
partially fills the waveguide cross section. Since co 2 is now a function of 
transverse position in the waveguide, it is difficult to solve (8.35) unless some 
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symmetric configuration is assumed. If the waveguide and the electron 
beam have concentric circular cross sections and co 2 is constant over the 
electron beam cross section, then the equation can be solved independently 
for the regions inside and outside of the electron beam using Bessel functions. 
These solutions can then be connected to give eigenfunctions which satisfy the 
continuity of E z and \E Z at the interface. The continuity of \E Z is necessary 
to ensure the continuity of E,. This analysis (Problem 8.4) shows that there 
is a pair of propagation constants similar to (8.44) which implies fast and 
slow waves for each eigenfunction. 

Judging from these examples, the fast and slow waves appear to exist in 
general, and in the remainder of this section we shall assume their existance. 
They are called space charge waves since the charge of electrons in space is 
responsible for the wave phenomena. 

Let us investigate the relationship between the electromagnetic power P 
and kinetic power P k of space charge waves. When the electronic and dis- 
placement currents cancel each other, as in the one-dimensional model, P 
must be zero since no magnitec field exists. In this extreme case, we have 

(-P/Jfe) = 0 (8.49) 

In the other extreme case in which the displacement current is negligibly 
small compared to the electronic current, (8.11) gives 

v x H, ~ ki 2 (8.50) 

Neglecting co 2 ep compared to y 2 , (8.37) becomes 

Ef — — y ” 1 V£ 2 (8.51) 

Integrating E(r)x H*(r) over the waveguide cross section, we have 



J E(r) x H*(r)-kdS = J E t x H ( * -k dS ~ — y" 1 j (VE Z x H/*).kdS 
= -y _1 Jv x £ 2 H f *.kdS + y _1 j E z \ x 



(8.52) 



where use is made of (8.51). The first term on the right-hand side of (8.52) 
becomes a line integral around the inside surface of the waveguide wall by 
Stokes’s theorem and vanishes because E,_ = 0. The integrand in the second 
integral on the right-hand side becomes the product of E z and i* from 
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(8.50). Thus, the electromagnetic power P is given by 



P ~ Re 




(8.53) 



which is independent of z. The kinetic power is calculated from (8.23) and 
(8.24). In the present case, v(r) in the kinetic voltage is given by (8.29) times 
exp(— yz), i g *( r) by i* exp(yz), while v 0 (r) is independent of z. Substituting 
these into (8.24), we have 

P k = Re ju 0 (jco - yv o y 1 J E z i* Jsj (8.54) 



which is also independent of z. The integrals in (8.53) and (8.54) are both 
pure imaginary as is easily seen from (8.31), and the insides of the brackets 
in both equations become real. The notation Re in (8.53) and (8.54) can 
therefore be eliminated which makes the ratio between P and P k equal to 



(P/Pk) - 1~' 



Substituting (8.44), this becomes 

(PlP k )*{±a> q l(coJa> q )} (8.55) 

where the upper and lower signs apply to the fast and slow waves, respectively. 
Since co is much greater than co q , (8.55) shows that power is mostly trans- 
mitted in the form of kinetic power, even in this extreme case with negligible 
displacement current. 

fn more general cases in which the electronic current is partially canceled 
by the displacement current, H, becomes correspondingly smaller and the 
ratio of P to P k will take some value between (8.49) and (8.55). We conclude 
from this that electromagnetic power is always small compared to the 
kinetic power if co>o) p > w q , or equivalently, if q 0 is small. When this is 
the case, the effect of the waveguide walls on the space charge waves is 
expected to be small, whether or not the walls are close to the electron beam. 
The majority of the transmission power is confined in the electron beam in 
the form of the kinetic power, and the electromagnetic field, which receives 
the constraint from the walls, only carries a small portion of the total power. 

As we discussed earlier, there are a number of solutions for (8.35); and for 
each eigenfunction E z , there are a pair of waves having propagation con- 
stants in the form of (8.44). Since E z is different for different pairs, some 
pairs are strongly excited through external circuits while the others are not. 
In the case of a thin electron beam, the excitation of only one pair can be 
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made strong, and these generally have almost constant E z over the beam 
cross section. For simplicity, let us concentrate on such a pair and neglect 
the effects of all other pairs. Since the current density of each wave is also 
almost constant over the beam cross section, the electronic current is given by 
the current density times the cross sectional area S 0 . Let J + and J _ be the 
electronic currents of the fast and slow waves at z = 0. Then the total 
current at arbitrary z is given by 

J{z) = J + e~ y * z + J_e~ y ~ z = (J + e j(, « z + J_ e - Jf * g ) e~ j(l * z (8.56) 

where 

P q = (<oJv 0 ) (8.57) 

On the other hand, from (8.23), (8.29), (8.31), and (8.44), the kinetic voltage 
is calculated to be 

V k (z) = Z 0 (J + e"«* - J_e~ j ^ z ) e~ j ^ z (8.58) 

where 



c q 0 o) S 0 J 0 



(8-59) 



The stationary component of the electron beam current is expressed by 
J 0 , and V 0 is the voltage applied to the electrons before they enter the 
region under consideration; hence, \mv 0 2 = eV 0 . 

The factor exp (~jfl e z) in (8.56) and (8.58) shows that the wave pattern 
as a whole moves in the positive z direction with the electron velocity v 0 . 
Except for this factor, V k (z) and J(z) have functional forms similar to the 
voltage and current along a conventional transmission line having a character- 
istic impedance Z 0 and phase constant ft,. If we define the reflection 



coefficient r by 




r = - (J_/y + ) e~ 2j ^ z 


(8.60) 


the impedance at z is given by 




~ K{ z ) _. 1 + r 




JV-) ^ °l-r 


(8.61) 


and the relation between r and Z can be studied 
Since 


using the Smith chart. 


|F*(z)|=Z 0 |J + ||I+r| 


(8.62) 


|J(ar)| = |J + ||l-r| 


(8.63) 



we can easily see on the Smith chart how the magnitudes of V k (z) and 
J(z) vary with z. 
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Let us consider the velocity of energy carried by each wave at angular 
frequency (o. Since the system is lossless, the velocity of energy is equal to 
the group velocity 



dco 

’■"sr 



dP 

da) 






- 1 



0 \ 1 ± 



dco q 

do 



(8.64) 



The upper and lower signs apply to the fast and slow waves, respectively. 
Usually, (» q is a slowly varying function of to, and the energy propagates 
with a velocity close to v 0 and in the same direction as the electron motion. 
The small discrepancy between the electron and energy velocities exists 
because a small portion of energy is transmitted in the form of electro- 
magnetic power, as discussed previously. Note that v g is equal to t> 0 for the 
one-dimensional model in which the electromagnetic power disappears. 

The kinetic powers of the fast and slow waves are calculated from 
appropriate terms in (8.56) and (8.58) to be 

P k = Z 0 \J,\ 2 and P k = -Z 0 \J.\ 2 



respectively. In the conventional transmission line, a negative power appears 
when the wave propagates in the negative z direction. The slow wave has, 
however, a negative power while propagating in the positive z direction. 
The following interpretation is usually given to explain the origin of the 





Fig. 8.2. The relation between electron velocity v and charge density |<?|: (a) Fast 
wave; (b) Slow wave. (Note that q is negative.) 



si 



negative power. We obtain from (8.29), (8.30), and (8.44), 

( v Iq) = Weo) { ± 0> q l(a> + <Oq)} (8.65) 

where the upper signs apply to the fast wave and the lower signs to the slow 
wave. Equation (8.65) shows that if we observe the electron beam at a fixed 
point z, the electron velocity and the charge density vary with time, as 
illustrated in Fig. 8.2. For the fast wave, the velocity increases when the 
magnitude of the charge density, and hence the electron density, increases. 
Since most of the electrons increase their kinetic energy in this way, the kinetic 
energy of the electrons on the average increases when the fast wave is excited. 
For the slow wave, the lectron density and the velocity are out of phase and 
consequently the average kinetic energy decreases when the slow wave is 
excited. This decrease of kinetic energy compared to the unexcited state 
propagates with the electrons in the positive z direction and is referred to 
as the negative kinetic power of the slow wave. 

8.3 Gap Interactions 

Let us consider two closely spaced planer grids which are capable of 
exciting a pair of space-charge waves for which E z is almost constant over 
the electron beam cross section as discussed in the previous section. The 
planes of the grids are perpendicular to the beam axis, and the gap between 
them is narrow compared to 2nlp e . Furthermore, it is assumed that the 
grid wires are sufficiently thin and the meshes sufficiently large that no 
electrons are intercepted, and yet a perfect electro-static shield is provided 
by the grids. Under these assumptions, let us calculate the electronic current 
and kinetic voltage at the exit from the gap, assuming their values at the 
entrance are given. Let z = - / and z = 0 be the locations of the first and 
second grids and V be the microwave voltage applied to the gap, as shown 
in Fig. 8.3. Since all the equations are linearized under the small signal 
assumption and the principle of superposition holds, the following two cases 
can be discussed separately : 

(i) The alternating voltage V is applied to the gap while keeping the 
current density i z (—l ) and alternating electron velocity u( — /) at 
the entrance equal to zero ; 

(ii) V is equal to zero, but i z (— l ) and v(— l ) have given values. 

The superposition of these two cases gives the desired solution. 

In case (i), the applied gap voltage V either accelerates or decelerates the 
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electrons depending on the phase of the voltage. The accelerated electrons 
tend to catch up the decelerated ones resulting in a modulation of charge 
density which in turn produces some E z in addition to the original gap field. 
If the gap is narrow, as assumed here, the variation of charge density within 
the gap is negligible since the decelerated electrons leave the gap before the 
accelerated ones catch up. Under this assumption, the field E z is equal to the 
applied voltage divided by the gap length /. The equation of motion for each 
electron is given by 

d e /_ v\ 

— v(z,t) = — — yjl cos ( at (8.66) 

at m\ l ) 



Since the electron velocity is approximately equal to v 0 , the electron velocity 
at z = 0 is obtained by adding the initial velocity v 0 at z = — / to the integral 
of the above quantity with respect to t from t — l/v 0 to t : 



where 



“ e V - 

- v 0 + — -J 2 cos cot dt 

Jt-ltv o fn 1 

e V sin (0/2) „ 

= v 0 + ——I y/2 cos (tot - i0) 

m v 0 (0/2) 

0 - ( (o/v 0 ) / = p e l 



( 8 . 68 ) 



The term 0 is called the transit angle of the gap. If we replace yjl V cos cot 
by the corresponding complex expression Ve jmt , the alternating component 
of the velocity can be expressed in the form 



»( 0 ) = 



e V sin (0/2) 
m v 0 (0/2) 



e -JW2) 
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From this, the kinetic voltage V k defined by (8.23) is given by 

V k (0) = MV 

where 

M = _sin(0/2 ) 

(0/2) 



(8.69) 

(8.70) 



The term M is called the beam-coupling factor. When / is sufficiently small, 
M is equal to - I which means that the kinetic voltage at the exit of a narrow 
gap is equal to the applied gap voltage, except for the opposite sign. The 
corresponding / 2 (0) can be calculated by integrating (8.15). Since the alter- 
nating component q of charge density is equal to zero from the discussion 
preceding (8.66), and i 2 ( — /) = 0 by hypothesis, we have 



'r (0) = 0 



(8.71) 



hence the electronic current J at z = 0 is equal to zero. This completes the 
discussion of case (i). 

In case (ii), the gap exerts no force on the motion of electrons. Because 
of the phase difference due to the transit time, we multiply v{- l) and 
h(— 0 by e~ ]0 to obtain i?(0) and f' z (0), respectively: 



v(0) = v(-l)e je (8.72) 

*z(0) = i z (- l)e- j0 (8.73) 



Note that p q l is neglected compared to p e l= 0, as comparison with (8.56) 
or (8.58) may indicate. 

In general, u(0) and i z (0) are given by the superposition of the above 
two cases; in terms of the kinetic voltage and electronic current, their values 
are 

V k { 0) = V k (~ /) e~ j0 + MV (8.74) 

J(0) = J(-l)e~ jd (8.75) 



So far, we have described how the electron beam is modulated by the gap 
voltage. Let us next consider the effect of the electron beam on the current 
flowing into the gap through the external circuit. To do so, we use the power 
relation (8.20) with the surface S' enclosing the gap space. The first term on 
the left-hand side is the electromagnetic power flowing out through the 
surface S. Since E, is equal to zero over the grid surfaces, no contribution 
to the integral results from these surfaces. The power flowing in from the 
side surface is given by VI*, where V is the gap voltage and / the current. 
The second term on the left-hand side expresses the net kinetic power. 
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Since V k {- l) /*(-/) enters through the first grid and F t (0) J*(0) leaves 
through the second, the left-hand side of (8.20) becomes 

— {VI* + V k (- l) J* ( - l) — V k (0)J*(0)} =-{Vl*-MVJ*(- l)e je } 

(8.76) 

where the negative sign appears in front of the bracket because of the 
direction of n. 

The first and second terms on the right-hand side of (8.20) express the 
stored electric and magnetic energies, respectively. We neglect the second 
term because the inductance of the gap electrodes are usually small. The 
stored electric energy is calculated as follows, first, we note that the trans- 
verse component of the electric field is negligible since the gap is narrow 
and the tangential components of the field at the grid surfaces are zero. 
Second, the stored electric energy multiplied by jco is given by 



j(oe\V/l\ 2 IS =j(oCVV > 



(8.77) 



where lS g is the volume of the gap space, C is the gap capacitance given by 
sSgjl, and the z-component of the electric field is approximated by V/l. 

Finally, the third term on the right-hand side of (8.20) will be shown to be 
negligible. The integrand is 



-Qov(z) v*(z) = q 0 



fk(z)F fc *(z) l*o 



v 0 (m/e) 2 V 0 



-“I *ttl a r 



(8.78) 



where V 0 is the static voltage required to accelerate electrons to the velocity 
v 0 , and i 0 = g 0 v 0 . Integrating the right-hand side over the volume occupied 
by the electron beam in the gap and multiplying the result by jco, we have 



2 V 0 v 0 Z 0 



(8.79) 



where <F* 2 > is the average value of |F^(z)| 2 from z = — l to 0 and Z 0 is 
defined by (8.59). Usually <F fc 2 >/Z 0 is of the order of V k J and fijl < 1 > 
therefore the contribution from this term is negligibly small compared to 
the terms appearing in (8.76). 

Summarizing the above discussion, (8.20) becomes 

- VI* = - l ) e j0 + jcoCVV* 

Dividing by — V and taking the complex conjugate, we have 

I = M*J {— l ) e~ J0 + jcoCV = MJ (— /) +jo)CV (8.80) 



(8.80) 
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The first term on the right-hand side represents the current induced by the 
electron beam while the second term expresses the current into the gap 
capacitance. The sum of these two currents make up the net current flowing 
into the gap through the external circuit. 

The above discussion applies to an ideal gap with two parallel grids, in 
practice however, a gridless gap is often utilized to avoid the interception of 
electrons as shown in Fig. 8.4. In such a case, the electric field at the electron 





Fig. 8.4. Gridless electrode gap. 



beam is weaker than the gap voltage divided by the gap distance, and the 
beam coupling factor M becomes smaller than that given by (8.70). To take 
this effect into account, M given by (8.70) is reduced by the factor t] <1. 
Let us assume that the transit angle of the gap is very small so that 



sin (0/2) 



then (8.74), (8.75), and (8.80) reduce to 



- 1 



(8.81) 



M0)= V k (-l)-tiV 9 = i = - t]J (-i) + j^cv 

(8.82) 



An equivalent circuit representing (8.82) is shown in Fig. 8.5. Note that the 
purpose of this equivalent circuit is simply to represent the relations (8.82), 
and it is not to be used for connecting impedances which fix the ratio between 
V k and J at the ports corresponding to the electron beam. 

In the remainder of this section, we shall discuss the principle of the 
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Fig. 8.5. Equivalent circuit of an electrode gap. 



klystron as an example of utilizing (8.82). In its simplest form the klystron 
has two gaps placed one quarter of a plasma wavelength l q = 2nlf$ q apart. 
The first gap excites the space-charge waves while the second one receives 
power from the electron beam. Since the gap capacitance C is usually too 
large to apply the gap voltage or to extract power effectively, the capacitance 
effect is canceled by the inductance of a resonant cavity. Taking a proper 
reference plane, the equivalent circuit of the cavity including the gap should 
look like the one shown in Fig. 8.6, where G is the conductance representing 
the cavity losses. The parallel resonant circuit is dual to the series resonant 
circuit discussed in Section 4.3. In the following discussion, subscripts 1 and 
2 will be used to express the quantities related to the first and second gaps, 
respectively. 

In order to apply a voltage V x to the first gap, a net power G X \V i| 2 is 



v 




Fig. 8.6. Electrode gap with a resonant cavity and its equivalent circuit. 
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necessary. Since the available power P a from the generator times the power 
transmission coefficient gives the net power, we have 



Gil^il 



{4 G.GJiGy + G g ) 2 } 4G 



“ + G t) 



where the cavity is assumed to resonate at the signal frequency, and G g is 
the generator conductance as seen from the gap. If the electronic current J 
and kinetic voltage V k are both equal to zero at the entrance to the first gap, 
V k becomes -r\ x V v , and J remains zero when leaving the gap. Taking the 
origin of the z-axis at the exit of the first gap, we have from (8.56) and 
(8.58) 

J+ + J- = 0 , Z 0 (J + — J_) = — t] j Vj 



from which J+ and J_ are obtained: 



J + =-iZ o'my,. 1,V, (8.84) 



Thus, fast and slow waves having the same magnitude but opposite signs 
are excited by the first gap. As we move along the electron beam toward the 
second gap, from (8.62) and (8.63) the magnitudes of V k (z) and J(z ) change 
as shown in Fig. 8.7. At the entrance into the second gap z 2 = 2J4, r = — l, 
and 

IM*2)I“0, |J(z 2 )| = 2|/ + | (8.85) 




Fig. 8.7. Kinetic voltage and electronic current versus distance z. 
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Applying (8.82) to the second gap, we obtain 



K(z 2 + * 2 ) = - 72^2. J{?2 + * 2 ) = *J(z 2 )» l2 = - t hJ(z2)+ja>CV 2 

( 8 . 86 ) 

Let us assume that the capacitive current ja>CV 2 is canceled by the 
inductive current in the cavity, then the total current flowing through the 
external load G L plus the cavity loss conductance G 2 becomes t] 2 J(z 2 ), and 
the gap voltage V 2 is given by 

V, = (G 2 + Gjr l t ll J(z 2 ) (8.87) 

Using (8.84), (8.85), and (8.87), the actual power to the load is calculated 
to be 

Pl = G l W 2 \ 2 = G l (G 2 + G l )~ 2 H2 2 Zo"Vim 2 (8.88) 
The transducer gain is given by 

(PJPJ = -!i 2 72 2 Zo _2 4 G,G l (G 1 + G„)- 2 (C 2 + GJ" 2 (8.89) 
When G, = G g and G 2 = G L , the maximum gain is obtained: 
f^) = 7iVZo 2 (4G 1 G 2 )- 1 

The kinetic power at the entrance to the second gap is equal to zero since 
the kinetic voltage is zero. From (8.86) and (8.87), the outgoing kinetic 
power from the exit is given by 

- rt 2 V 2 J*{z 2 ) = - (G 2 + GJ" 1 r, 2 2 \J(z 2 )\ 2 

which indicates that the electron beam receives this amount of negative 
kinetic power from the gap. On the other hand, from (8.87) the total power 
into the load G L and G 2 is given by 

(G 2 + g l ) \ v 2 \ 2 = (G 2 + g l )-' 1 , 2 2 \J(z 2 )\ 2 

As expected, the power the electron beam receives plus the power to G L 
and G 2 is equal to zero. In other words, the gap receives power from the 
electron beam and delivers it to G L and G 2 . 

In order to increase the maximum gain, G x and G 2 must be decreased 
which is obvious since G t and G 2 express the cavity losses. On the other 
hand, some explanation may be necessary as to why G g and G t have to be 
decreased in order to increase the transducer gain when G x and G 2 can be 
neglected. Since the electronic current J is equal to zero at the exit from the 
first gap, no kinetic power is added to the electron beam by the gap. This 




means that no power is required to excite the space-charge waves in the 
first gap although a large gap voltage is necessary for a large gain. To 
increase the gap voltage V u keeping P a constant, G g has to be decreased. 
For the second gap, the active component of the current / 2 is solely deter- 
mined by the electronic current J(z 2 ). In order to increase the output power 
under the constant current condition, the voltage V 2 has to be increased by 
decreasing G L . Hence, G g and G L must be small for a large gain. 

Summarizing the above discussion, a klystron amplifies a signal because 
a negligible amount of power is required to excite space-charge waves in a 
narrow gap while a second gap one quarter of a plasma wavelength away 
provides a constant current source from which useful power can be extracted. 



8.4 Continuous Interaction with a Slow Wave Structure 

It is possible to achieve amplification utilizing the continuous interaction 
between an electron beam and an electromagnetic wave on a slow wave 
structure when their velocities are nearly equal. An electron tube based on 
this principle is called a traveling wave tube. The slow wave structure for a 
traveling wave tube is often realized by winding a thin wire into a helix with 
its diameter sufficiently small compared to a wavelength in free space. The 
electromagnetic wave propagating along the helix tends to follow the wound 
wire, and its axial phase velocity can be made approximately equal to the 
electron velocity when the pitch of the helix is properly chosen. In addition 
to helices, many periodic structures of the type discussed in Sec. 6.4 can 
be utilized as slow wave structures in traveling wave tubes. 

Let a x (z) represent the electromagnetic wave propagating along the slow 
wave structure in the positive z direction. By suitable normalization, \a x (z)| 2 
is made to express the transmission power. The interaction between the 
electron beam and the slow wave structure takes place through the 
z-component of the electric field with a proper phase constant. Since £. is 
90° out of phase from E f , as (3.61) indicates, if the phase of a x {z) is chosen 
to be in phase with the corresponding E r , the z-component of the electric 
field at the electron beam in the slow wave structure becomes 

E z e~ jls * 2 =jP s Z l k /2 ai (z) (8.90) 

where /? s represents the phase constant of the proper field component (i.e., 

& - and z l 12 's a proportional constant. Note that Z k has the dimen- 
sion of impedance. 
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Let us next express the space-charge waves, along the electron beam by 
a 2 (z) = Z i 0 l2 J + e li/ ’- f ^, (8.91) 



then the transmission powers in the positive z direction due to the space- 
charge waves are given by |a 2 ( z )| 2 and — |« 3 (z^)| 2 , respectively, since the 
electromagnetic power of each wave is negligible compared to its kinetic 
power. 

Assuming a small coupling between the electron beam and the slow wave 
structure, we now apply the theory of coupled modes described in Section 
6.1. To do so, however, we must obtain the coupling coefficients C 2l and 
C 31 . Suppose that a 2 (z) and a 3 (z ) are zero at z = 0; let us investigate their 
values at z = Az for a given ^(0). From (8.90), E z is approximately given 
by jfi s Z]J 2 a t (0) over the narrow region from z = 0 to Az. If we set / = Az 
and V = — E z Az — — jfl s Z* l/2 ^(0) Az, then the problem reduces to that 
of the gap interaction discussed in Section 8.3. In the present case, Az = l 
is an infinitesimal, and the beam coupling factor M becomes — 1. Thus, 
the kinetic voltage at z = Az is given by 



V t (Az) = -V= jp, Z' t n a , (0) Az (8.92) 



The electronic current remains the same: 



J (Az) = 0 (8.93) 

Combining (8.91), (8.92), and (8.93), we have 

a 2 (Az)-a 3 (Az)=jP s (Z k IZ o ) 1,2 a 1 (0)Az, a 2 (Az) + a 3 (Az) = 0 (8.94) 

from which a 2 (Az) and a 3 (Az ) are calculated to be 

a 2 (Az) = ii/UZ./Zo)*' 2 (0) Az (8.95) 

a, (Az) = - ijp,(Z k IZ o) 1 ' 2 a, (0) Az (8.96) 



Comparing these equations with (6.4) and remembering that a 2 (0) = a 3 (0) 
= 0, C 21 and C 31 are found to be 

c 21 =-C* 2 = - ijf} s (Z k IZ 0 ) 112 , C 31 = C* 13 = i j(t s (ZjZ 0 ) 112 (8.97) 

The coupling coefficient C 23 between the fast and slow space-charge waves 
will be neglected since it becomes a higher order infinitesimal when we 
impose the condtion of small coupling between the electron beam and the 
slow wave structure. 

For simplicity, let us first consider the interaction between two waves. 
When a v (z) interacts with a 2 (z), the two waves exchange transmission power 
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back and forth as we discussed in connection with Fig. 6.3. On the other 
hand, if a 1 (z) and a 3 (z) have similar phase velocities and interact with each 
other strongly, growing waves will result as we discussed in connection with 
Fig. 6.4. Let 

Po + Afi o = p s , p 0 - A{ % = P q + p e (8.98) 

and assume that a l (z) = A 0 and a 3 (z) = 0 at z = 0, the input of the coupling 
region, then 

|a, (z) | 2 = A 0 2 (1 + (|C 13 |/a) 2 sinh 2 az} (8.99) 

- \ a s( 2 )\ 2 — ~ d 0 2 (\C l3 \ja) 2 sinh 2 az (8.100) 

where 

* = {\C i3 \ 2 -(Ap Q ) 2 Y 12 (8.101) 

and \C 13 \ 2 > (Ap 0 ) 2 is assumed. If the two waves are separated at z = L 
and the electromagnetic wave along the slow wave structure is fed into a 
matched load, the transducer gain is given by 

l«i(T)| 2 /[fl I (0)| 2 = 1 + ([C 13 |/a) 2 sinh 2 aL (8.102) 

provided that the input is also matched. This explains the amplification 
mechanism of traveling wave tubes. 

In practice, and |C 13 | often have the same order of magnitude, in such 
a case the interactions among three waves, a x (z), a 2 (z), and a 3 (z) must be 
considered. The eigenvalue problem for the system is given by 





‘ JPs 


^13 ^13 






^13 


j(Pe~P q ) 0 x = yx 


(8.103) 




L c 13 


0 j(Pe + P yj 




The eigenvalues are obtained from the solutions of the third order algebraic 


equation 


iP s - y 


- cL c 13 I 






^13 
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■a 
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II 

O 


(8.104) 




c* 

C 13 


0 j(Pe + P q )-y\ 




If we set 




y=jPe + 8 


(8.105) 


Eq. (8.104) becomes 








(A 2 + P 2 ){- i + j (ft - ft)} + 2/ft | C, ,| 2 = 0 


(8.106) 



Let a and y be the real and imaginary parts of <5: 



8 = a + jy 



(8.107) 
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then (8.106) can be decomposed into real and imaginary parts 

- «[a 2 - / + P* -2 y (y - (fi, - &)}] = 0 
(~y + P.~ W(« 2 - / + ft 2 ) - 20 + 2 ft|C, 3 | 2 = 0 



as follows: 
(8.108) 



By inspection, there are solutions satisfying 



« = 0, P,-P. = 2f} q \C 13 \ 2 (y 2 - p 2 )' 1 + y 



(8.109) 



The waves corresponding to these solutions do not grow or decay with z 
since the real part of y is equal to zero. If we assume nonzero a (8.108) gives 

a = + [2 :y{y- (A - ft)} + y 2 - ft 2 ] 1 ' 2 (8.1 10) 

ft - A = 2 y± {ft 2 + ft ic 13 | 2 y - ‘} ,,J (8.111) 



Suppose that the value of /y|C 13 | is fixed, then one value of fi s — fi e can be 
calculated from (8.109) for each value of y. Similarly, (8.1 1 1) gives the other 
values of & - /? e , and (8.110) gives the corresponding a, provided that a 
and fl s - p e , thus obtained, are all real. In this way, we can obtain all the 
possible values of a and for each y from which we can reconstruct 

the propagation constant y as a multivalued function of fi s — ft e . As an 
example, the result of the calculation for a particular case in which = 
yj2\C t3 \ is shown in Fig. 8.8. To facilitate a direct comparison with Fig. 6.2, 
the real and imaginary components of y/|C l3 | are shown as functions of 
AMC n \. The interaction between a t (z) and a 2 (z) gives Fig. 6.2(a), and 
that between a r (z) and a 3 {z) gives Fig. 6.2(b), however, when a x (z) inter- 
acts with a 2 (z) and a 3 (z), as in the present case, the two patterns are inter- 
mingled and distorted as illustrated in Fig. 8.8. 

Once the eigenvalues are obtained, the corresponding eigenvectors can 
easily be calculated from (8.103): 



x. = 



c 13 /(-<w/y 
cun-*t+n yJ 



(i = 1, 2, 3) 



( 8 . 112 ) 



Note that three eigenvectors are obtained corresponding to three different 
eigenvalues. Suppose that Rey ( - = a f <0 and that x x represents the corre- 
sponding growing wave, then the amplitude of Xj increases with z while the 
other waves either decay or propagate unchanged, hence the Xj component 
outweighs the others at the output port provided that the coupling region 
is sufficiently long. 

Under the conditions of the previous paragraph the transducer gain can 
be calculated as follows: We assume that a 1 (z) = A 0 at the input z = 0. 






Fig. 8.8. P — po and a versus Apa ~ ±(p s - p e — p q ). 



then the x l component at z = 0 is given by 



where 



*i 



x,P 










3V + /v - 2«5J(£ 



- M t 1 



-Cta -C 13 - 

~)h- 



At the output where z = L, the magnitude of the x x component becomes 
exp(— a t L) times as large, and hence the transducer gain becomes 



ai (L) 


2 


&i 2 + P q 2 






W + p, 2 -2 ft,(ft-ft) 



(8.113) 
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Let us assume that the fast and slow waves are given by A t and A 2 , 
respectively, at a cross section a of the electron beam shown in Fig. 8.9, 
and that they become B 1 and B 2 at another cross section b situated beyond 
a lossless circuit. Let a and b be defined by 




(8.114) 




Fig. 8.9. Lossless transformation of space charge waves. 



Under the small signal assumption, the system is linear and b must be express- 
ible as a square matrix K of order 2 times a: 

b = Ka (8.115) 

The total transmission power in the positive z direction at a is given by 

\A l \ 2 — \A 2 \ 2 — a + Pa (8.116) 

where 

P = diag[l -1] (8.117) 

The negative sign in front of \A 2 \ 2 comes from the fact that the slow wave 
carries a negative power. The total transmission power at b is given by 
b + Pb, and since the system is lossless, these two powers must be equal, i.e., 

a + Pa — b + Pb = 0 

or equivalently, 

a + (P — K + PK) a = 0 (8.118) 

Since a is arbitrary, we have 

K + PK = P (8.119) 

which expresses the lossless condition for K. Taking the determinant of 
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( . 19) we see that det K is not equal to zero, and hence K -1 exists 
n order to discuss electron beam noise, let us consider the noise power 
<> mdicates tha ensemble average. For example, 
£ V’ f 15 " 1 1 com P° nent <aa + >, expresses the noise power 

the f3St ™ ve ln a narrow bandwidth B while the 1 2 component <A.A »> 
expresses the correlation between the noise components in the fast and 
slow waves. At the cross section b, the noise power matrix becomes <bb + > 
and because of (8.115), it can be written in the form 

<bb + > = K <aa + > K + (8 . 120) 

Multiplying (8.120) by P from the right and using (8.119), we have 

<bb + > P = K<aa + > K + P = K<aa'> PK‘ (8.121) 

Note that <aa + > P undergoes the similarity transformation by K when the 
electron beam passes through the lossless circuit. Since the trace and deter- 

d“ through' 1 ™ *" inVariam t0 3 Simil3rity tranSf — ’ ", aad «. 

tr«aa + > P) = <|A 1 | 2 > — <|A 2 | 2 ) = n p « 122) 

det«aa + > P) = < Ml A 2 T> _ <Mj|I> <M/> „ i(< _ ^ ^ 

are invariant. 

r ShaH f ^ CUSS thG measurement of the noise parameters ». and » 
n terms of A x and A 2 , the electron beam current is given by 

J ( z ) = (Z 0 )~ 1/2 {A ie ^ z + A 2 e~ JPqZ } e~ ifs * z ( 8 . 124 ) 

from which we have 

<1^ (a)| 2 > = 2." KMrl 2 ) + <M 2 | 2 > + 2<M,Vl> cos(2fi q z + q,)} 

where tp is defined through (8 ' l25) 

,an V = {ta«-4iV»/Re«/M 2 *» (8.126) 

in fLTiw f t P h C ° UpIed resonant cavit > al0 ”g ^ beam, as shown 

' 8 -’° (a)> the no,se out P ut the cavity is proportional to <|./(z)| 2 > 

I e, r n beam d,rect currem is red - ed *° a 

f,^l‘ e V r aVlty ° UtPUt COrres P° nds to the f“H shot noise 2eBJ 0 = 

vleo/f f avity outpul using this shot noise - the abso,u * 
alue of <|/(z)| > under the operating condition can be obtained from the 

above measurement. 
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<|J| 2 > 




(a) (b) 



Fig. 8.10. Measurement of noise parameters n s and n p . 




The product of the maximum and minimum values of <|/(z)| 2 > is cal- 
culated from (8.125) to be 

<|/| 2 > min = Zo 2 {«Mil 2 > + <M 2 I 2 )) 2 - 4<M^ 2 *| 2 )} 

(8.127) 

From (8.122) and (8.123) the terms inside the brackets on the right-hand 
side are equal to n 2 , so that 



« s 2 = Z 0 2 <[J| 2 > max <|J| 2 > r 



(8.128) 



The impedance Z 0 can be calculated from V 0 , J 0 , [i q , and p e , and the value 
of <|/| 2 ) max <M 2 ) min , is obtainable from the measurement of <|7(z)| 2 ). 
Thus, n s can be determined from (8.128). 

To obtain « p , let us next take the ratio of the maximum and minimum 
values of <[/(z)| 2 >: 



2 _ (M 2 )™* = <M,I 2 > + <M2l_ 2 M L 2<M^ 2 *|> 
Q <|i| 2 > mfa <M.I 2 > + <M 2 | 2 > - 2 <i4VI> 



(8.129) 



If there is no correlation between A l and A 2 , Q 2 = 1, and the cavity output 
remains the same regardless of its position along the beam. Only when A x 
and A 2 are correlated, <|y(z)| 2 > changes as shown in Fig. 8.10(b). From 
(8.122) and (8.123), n p /n s can be expressed in terms of q, (Mil 2 ) and 
<|T 2 | 2 > as follows: 

w P _ g 2 + i <Mil 2 >-<M 2 | 2 > (8130) 

», 2 q <M 1 | 2 > + <M 2 [ 2 > 



This shows that n can be calculated from the previously measured n s and 
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q if the relative magnitudes of <|^4 1 | 2 ) and <(T 2 | 2 > are obtained. The terms 
(Mil 2 ) and — (M2I 2 ) express the fast and slow wave noise powers, 
respectively. Suppose that two cavities with each gap coupled to the electron 
beam are separated by a certain distance and that their outputs are combined 
through appropriate phase shifters, then the resultant output can be made 
to couple either with the fast wave only or with the slow wave; in this way 
the relative magnitudes of \A X \ 2 and \A 2 \ 2 can be obtained. Thus, both n s 
and n p are measurable with movable cavities coupled to the electron beam. 

Let us finally consider the best amplifier noise performance obtainable 
from an electron beam with given n s and n p . To do so, we first try to obtain 
a canonical form of the noisy electron beam. The matrix <aa + > is self- 
adjoint and can be either positive-definite or positive-semidefinite; we shall 
restrict ourselves to the case in which <aa + > is positive-definite, then we 
have two self-adjoint matrices P and <aa + > one of which is positive-definite. 
From the discussion leading to (5.60) and (5.61), they can be simultaneously 
diagonalized by a matrix H as follows. 

H + <aa + > H = I (8.131) 

H+PH =[o "J ( 8 - 132 > 

where both 2, and l 2 are real. Taking the determinant of the left-hand side 
in (8.132), we obtain 

det H + det P det H = det P |det H| 2 

which is negative. Consequently, the determinant of the right-hand side, 
V 2 , must be negative, in other words, and X 2 have opposite signs. If the 
first diagonal component of (8.132) happens to be negative for a certain H, 
use HL in place of H, where L is given by 



then we have 




LH + <aa + > HL = LIL = I 



LH + PHL = L 




and the first component becomes positive. Without loss of generality, we 
can therefore assume that X x is positive in (8.132) and that X 2 is negative. 
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Let r be defined by 




r ru,r 1/2 o i 
L o (-a 2 )-‘' 2 J 


(8.133) 


then (8.132) can be written in the form 




H + PH = r 1 pr 1 




which is equivalent to 




TH + PHr = P 


(8.134) 


On the other hand, from (8.131) we have 




rm<a^>HT=[f _° ] 


(8.135) 


If we set 




K = TH + 


(8.136) 


then (8.134) and (8.135) become 




KPK + = P 


(8.137) 


k<»» + >k + = [T .%] 


(8.138) 


Following the derivation of (7.31), (8.1 19) shows that K satisfying (8.137) 
represents a lossless transformation. Multiplying (8.138) by P from the 


right and comparing the result with (8.121), we have 




tr ([T -°v] p 


(8.139) 


det ([ n — ‘] P ) = ^ 1 1 = ^ ^ np2 


- */) (8.140) 



where use is made of the fact that the trace and determinant of <aa + > P 
are invariant to lossless transformations. From (8.139) and (8.140), we 
obtain 

V = i (« s + n P ) > “ K 1 = i(», “ n P ) 

Let <a'a' + > be the noise power matrix of the electron beam after the 
lossless transformation by K, then we have 

< aV+ >=[ i( V" p) ifc.J (8141) 

The matrix K transforms a to a' and K _1 transforms a' back to a. Note that 
K _1 also represents a lossless transformation since (K~ 1 ) + PK _1 =P from 
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(8.119). We conclude from the above discussion that an electron beam 
with <aa + > can be considered as the result of the lossless transforamtion 
K -1 applied to a similar electron beam with <a'a , + >. The electron beam 
with <a'a' + ) corresponds to the canonical form of linear noisy networks 
discussed in Section 7.1, and K corresponds to the canonical transformation. 

We are now in a position to calculate the optimum noise measure of the 
amplifier shown in Fig. 8.11. The power waves at the input are given by a v 




l ° 3| 2 |° 3 | 2 | b 3 | 2 

-M 2 -Kl 2 -|b 4 | 2 



Fig. 8.1 1 . An electron beam amplifier. 



and b u and at the output by a 2 and b 2 . The generator and load are assumed 
to have impedances with positive real parts. Let a 3 and a 4 be the noise 
components in the fast and slow waves at the entrance to the coupling 
region, and let b 3 and b 4 be those at the exit. Since a 3 and a 4 can be trans- 
formed by a lossless circuit to uncorrelated waves a 3 and a 4 \ as we dis- 
cussed above, we can use a 3 and a 4 instead of a 3 and a 4 to investigate the 
effect of the most general lossless network. 

Let us define a scattering matrix S through 
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then the gain and noise measure of the amplifier are given by 

C = |S 21 | 2 (8.143) 

„ | S 22 | J | a; | J + | S 23 | 3 | a3 '| 2 + | S 24 | 2 | I34 '| 2 /0(JJ> 

M = (|s 2 ,| 2 - 1) |o,l" (8 - 144) 

where 

\a i \ 2 = kT i B, \a 2 \ 2 = kT L B , \a 3 '\ 2 = $(n s + n p ) t \a 4 '\ 2 = \ (n s - n p ) 

(8.145) 

Since S represents a lossless network, it satisfies 

S + PS = P 

where P = diag[l 11 —1]. From this, following the derivation of (7.32), 
we obtain 

| S 2 1 | 2 + | S 22 | 2 + | S 23 | 2 -| S 24 | 2 = l 



Substituting this relation into (8.144), we have 

| S 22 | 2 | a 2 | 2 + | S 23 | 2 | fl 3 '| 2 + | S 24 | 2 | o 4 '| 2 

JyJ =: 

(-| S 22 | 2 -| S 23 | 2 + | S 24 | 2 )| a ,| 2 



from which the optimum value of M is calculated to be |u , 4 '| 2 /|a 1 | 2 , or 
equivalently 



M opt = {(«, - n^lkTfi) (8.147) 



where use is made of (8. 145). An argument similar to the one given in Section 
7.2 shows that it is impossible to achieve a positive M smaller than M opt 
with any passive network. 

In the above discussion, we used a small signal assumption and considered 
only one pair of space-charge waves. In practice, the small signal assumption 
will not hold in the vicinity of the cathode where v 0 is small, hence there is a 
possibility of improving the optimum noise measure of an electron beam 
by modifying the potential profile near the cathode. Consequently, a major 
effort has been concentrated on the study of cathodes and their vicinity in 
order to obtain low noise electron beam amplifiers. 



PROBLEMS 

8.1 Calculate the plasma frequency of an electron beam with diameter 1 mm, direct 
current Jo — 10 mA, and accelerating voltage Vo — 1000 V. 

8.2 Prove that the ratio between the electromagnetic and kinetic powers of a uniform 
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electron beam filling the inside of a waveguide is given by 

P =b OJq 0. >p 2 — 03q 2 
Pk 03 =F 03 q (Op 2 

where the upper signs apply to the fast wave and the lower signs to the slow wave. 

8.3 Suppose that one additional cavity is gap-coupled to the electron beam midway 
between the input and output cavities of a klystron; calculate the gain as a function 
of the distance between adjacent cavities. 

8.4 Solve the eigenvalue problem (8.35) for a uniform electron beam with radius a 
located coaxially inside a circular waveguide with radius b (b > a). Assume rotational 
symmetry of the modes to simplify the calculation. 

(Hint: The propagation constant y of each mode can be obtained from (8.43) 
with k n satisfying 

k n Jo(k n a ) _ M/o'OM Ko(peb) - K 0 '(M) to (M)} 

Jo(k„a) Io(Ped) Ko(fieb) — Ko(fi f a) Io(fieb) 

where lo(fier) and Ko(P e r) are hyperbolic Bessel functions and use has been made of 
(o 2 e/i + y 2 ~ — p e 2 . The hyperbolic Bessel functions are two independent solutions 
of the differential equation ( d 2 u/dr 2 ) + (l/r) (clu/dr) — p e 2 u — 0.) 



8.5 Calculate the gain of a traveling wave tube without assuming the predominance of 
the growing wave at the output end of the coupling region. 
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CHAPTER 9 

OSCILLATORS 



Oscillators convert dc energy into rf. They differ from amplifiers in that 
the presence of an input signal is not essential for their operation while 
their output frequency and amplitude are primarily determined by their 
circuit characteristics. On the other hand, an amplifier produces its output 
signal only when the input signal is present. 

Oscillators are inherently nonlinear and the simplest model has to take 
this effect into account; otherwise, the model will not reach a stable operating 
point. Keeping this in mind, we shall first construct a simple oscillator 
model and study its behavior, including the synchronization by a small 
signal injection. Then, we shall discuss the noise of free-running as well as 
synchronized oscillators. Since the concept of generalized functions and 
their Fourier analysis will be used extensively in the noise discussion, it is 
recommended for those who are not familiar with generalized functions 
that they read Appendix II before beginning Section 9.3. 



9.1 A Simplified Oscillator Model 

Let us consider a solid state oscillator which consists of a cavity containing 
an active device and a waveguide connecting the cavity and the load, as 
shown in Fig. 9.1. Since the active part of the device is generally very small 
compared to a wavelength in free space, the conventional voltage and current 
will be well defined at the terminals of the active part at the fundamental 
frequency of oscillation. 

At each harmonic frequency, the device sees a certain environment, say 
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CAVITY WAVEGUIDE LOAD 




Fig. 9.1. A solid state oscillator. 



impedance, which is not affected by the load condition if the harmonic 
content in the waveguide at the output port is small. Under this condition, 
the active part of the device at the fundamental frequency exhibits a certain 
nonlinear admittance which, in general, varies slowly with frequency 
(although this may not be true if anyone of the harmonics is in a strong 
resonance). Let us consider a two-port resonant cavity consisting of the 
region between an appropriate reference plane in the output waveguide of 
the oscillator and the terminals of the active part of the device. Suppose 
that only one resonant mode predominates, then the equivalent circuit of 
the cavity must be similar to that shown in Fig. 4.10. A certain length of 
transmission line may have to be inserted in the equivalent circuit before 
connecting it to the nonlinear device admittance since a shift of reference 
plane is not allowed on the device side of the equivalent circuit. This means 
that the nonlinear device admittance undergoes a transformation by the 
same transmission line before it is connected to the equivalent circuit in 
Fig. 4.10. Let us express the transformed nonlinear admittance in the form 
of a nonlinear impedance. Notice that the nonlinear impedance also varies 
slowly with frequency. The equivalent circuit of the oscillator, including 
the load resistance R 0 , then becomes a series connection of an inductance, 
a capacitance, a small positive resistance R t representing the cavity loss, 
the nonlinear device impedance - R + jX, and R 0 as shown in Fig. 9.2, 
A small voltage source e(t) represents the noise in the device and the circuit 
and/or the effect of a synchronizing signal, which we shall study later. 
The equation governing the behavior of the oscillator is given by 

di , 1 f 

L - dt + {Ri + Ro) l + c \id t + v = e(t) (9.1) 

where v is the voltage developed across the device impedance — R + jX. 
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-R + jX> V 




Fig. 9.2. Equivalent circuit of an oscillator. 



When e(t) is equal to zero, the current is expected to be approximately 
sinusoidal; 

i (f) = A cos (cut + (p) (9.2) 

where harmonic currents are neglected. The voltage drop across the device 
impedance can be expressed as 

v = - RA cos (cut + (p)- XA sin (cut + <p) (9.3) 

provided that the frequency dependences of R and X are negligible. (A 
linear part of the device reactance can be included in the LC circuit of the 
cavity, thereby relaxing this restriction to a certain extent.) Now suppose 
that a small perturbation is given to the system by applying a small e{t ), 
then i(t) may no longer be sinusoidal. In many cases, however, i(t) is 
similar to the original waveform. We shall restrict ourselves to these cases 
only, and assume that /(/) can be represented by 

/ (/) = A ( t ) cos {cur + <p (r)} (9.4) 

where A (r) and (p{t) do not change appreciably over one cycle of the oscil- 
lation (= Injoi). Under this condition, dijdt and J / dt are given by 

di ( d(p\ t . dA 

— = — A { cu + — sin (cur + <p) + cos(cut - 1 - <p) (9.5) 

dt \dt dt 



( A A dtp 

i dt = ( = — 

\cu o y dt 



J sin (cur + (p) H — j — cos (cur + <p) (9.6) 

J cu dt 



where use is made of integration by parts neglecting the higher order terms 
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of (\ 1(d). Equation (9.3) is still valid provided that A and (p are now interpreted 
as functions of time. Substituting Equations (9.3) through (9.6) into (9.1), 
multiplying by cos (cor + <p) or sin (cur + <p), and integrating with respect to 
time from t-T 0 to r, where T 0 is a period of the oscillation (=2tt/cu), we 
obtain approximate differential equations in A and cp: 



1 \dA 2 

L + a?c)dl + ^ + R <>~V A = ^ 



ToJ 



e(r) cos (cor + (p ) dt 



i \d(p 2 r 

> 2 c) dt~AT „],- To Sin ^“' + ^ dt 



where the orthogonality relation between since and cosine functions has 
been used. 

Even though higher harmonics at the output port are assumed to be 
small, since e(t) is also small, their effect may be of the same order of 
magnitude in the original Eq. (9.1). However, the above integrations with 
respect to time, eliminate from our discussion the possible effect of higher 
harmonics because of the orthogonality relations between the fundamental 
and harmonic components. 

Having obtained the differential equations which govern the behavior of 
A and < p, let us examine the free-running oscillation in these terms. For 
steady state free-running oscillation, dA/dt = 0 and e(r) = 0. Consequently, 
(9.7) gives 

Rj + R 0 = R (9.9) 



In general, R is a function of A and may appear similar to Fig. 9.3(a). 
From (9.9), the amplitude of the current is determined as the intersection of 
R and R t + R 0 . Let the amplitude be A 0 . If sR 0 represents A 0 times — dRjdA 
at A 0 , as shown in Fig. 9.3(a), then, for a small variation A A from A 0 , we 
have 

Ri + R 0 ~ R = sR 0 (AAjA 0 ) (9.10) 



In the vicinity of A 0 , Eq. (9.7) becomes 

(r 1 \ dAA 

w +sR ° AA=0 

Suppose AA is not equal to zero at r = 0. If 5>0, then \AA\ decreases 
exponentially with time, and A 0 represents a stable operating point. On the 
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Fig. 9.3. (a) R as a function of A; (b) X as a function of A. 

other hand, if R intersects R t + R 0 such that s < 0, then \AA\ grows indefinitely 
with time. This means that such a point does not provide stable operation. 

The steady state oscillation frequency is determined from (9.8) together 
with dcpjdt = 0 and e(t) = 0; 

coL + JP=(l/o>C) 

Assuming that X is small, the oscillation frequency is calculated to be 

cOo=o> 0 {l-W;J(XiR 0 )} ( 9 . 11 ) 

where co 0 and Q ext are given by 

cu 0 = ULC )- 1/2 , Q ext = (g> 0 LIR 0 ) (9.12) 




When X depends on A, as illustrated in Fig. 9.3(b), the value of Jfat A = A 0 
must be used in (9.1 1) to obtain co 0 '. 

fn the above discussion, a small parallel conductance G has been neglected 
which represents the losses of other resonant modes and remains uncanceled 
after a proper shift of the reference plane at the output port. A similar 
conductance on the device side of the equivalent circuit can be considered 
to be included in the device impedance. 

A plot of constant power and constant frequency contours on a Smith 
chart of the load impedance of an oscillator is called a Rieke diagram. 
When the load impedance has a reactive part X 0 , in (9.11) X has to be 




Fig. 9.4. Rieke diagram of an ideal oscillator (a) G = 0; (b) G ^ 0. The usual 
Smith chart constant resistance and reactance contours have been omitted. 
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replaced by X + X 0 . From this modified equation and (9.9), the Rieke 
diagram of our oscillator model has the appearance that is shown in Fig. 
9.4(a). As the magnitude of the load impedance increases, the effect of the 
conductance G mentioned above is no longer negligible, and the constant 
power contours with nonzero G will be modified, as illustrated in Fig. 9.4(b). 
Although the constant power and constant frequency contours intersect 
each other perpendicularly in Fig. 9.4(a), their intersections will make 
oblique angles if X is nonlinear (Problem 9.2). Furthermore, if more than 
one resonant mode is strongly excited in the cavity, the Rieke diagram will 
not resemble the one in Fig. 9.4(a). How well our model represents a given 
oscillator in the vicinity of the operating point can easily be checked by 
measuring the power and frequency as functions of load impedance and 
drawing an appropriate Rieke diagram for the oscillator. 



9.2 Synchronization by Injection Locking 

In this section, we shall study the injection locking mechanism of oscil- 
lators. For simplicity, let us assume that R 0 is equal to the waveguide 
characteristic impedance Z 0 of the output port. Furthermore, let e(t) in 
(9.7) and (9.8) represent the injection signal voltage a 0 cos o) s t, where a 0 is 
assumed to be small. When the oscillator is synchronized, the oscillating 
frequency must be the same as co s . The right-hand side of (9.7) is given by 

2Tq 1 a 0 cos coj cos(oj s t + <p) dt = a 0 cos (p 

J t-T 0 



Thus, (9.7) becomes 



dAA 

2 L V sR 0 A A = a 0 cos (p 

dt 



(9.13) 



where A A = A - A 0 , and use is made of LA (l/co s 2 C) ~ L + (l/m 0 2 C) = 2 L. 

When the oscillation is in the steady state, dAAjdt = 0, and (9.13) gives 

AA q = (a 0 cos (Po/sRq ) (9.14) 

On the other hand, the right-hand side of (9.8) is given by 

2(AT 0 )~ 1 f a Q cos (D s t sin (c o s t + <p) dt = (a 0 M) sin (p J 

J t-T 0 f 

Strictly speaking, A may not be the same as A 0 ; however, since a 0 F small, |' 
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the difference between A and A 0 can be neglected. Thus, (9.8) becomes 

('“‘ L+ i-*)- 2L rf? = 2 s ‘ n ' ? ’ (9 - ,5) 

For the steady state, dipfdt = 0 and <p must be a constant. Let this constant 
be <p 0 and let co s be co 0 ' + Aco 0 , where Aco 0 is the difference between the 
synchronizing and free-running frequencies. To a first order approximation, 
(9.15) now becomes 

- A(o 0 2L = (a 0 /A 0 ) sin<p 0 + rR 0 (AA 0 fA 0 ) 

where rR 0 is A 0 times dXjdA at A 0 , as shown in Fig. 9.3(b). Substituting 
(9.14) we have 

Aoj 0 = -iL~ 1 (a 0 /A 0 ) (1 + (r/.s) 2 ) 1/2 sin (<p 0 + 9) (9.16) 

where 

9 = tan (r Is) (9.17) 



Since sin(<p 0 + 0 ) must be less than 1, the maximum value of \Aco 0 \ for syn- 
chronization is given by 

M^oLx = 1 (a 0 /A 0 ) (1 + (r/s) 2 } 112 (9.18) 

Using this value of |zfm 0 | max , Eq. (9.16) becomes 

sin((p 0 + 0) = - (Ao) 0 l\Ao) 0 \ max ) (9.19) 

In the vicinity of A 0 and <p 0 , writing A = A 0 + AA 0 + AA and <p = (p 0 + A(p, 
Eqs.(9.13) and (9.15) reduce to 

dAA 

2 L + sR 0 A A = - a 0 A(p sin q> 0 (9.20) 

dA(p AA a 0 

~ 2L T/T rR ° = -rA(pcos<p 0 (9.21) 

ill yT 0 A 0 

Eliminating AA from these equations, we obtain 




For stable operation, \A(p\ must decrease with time if Aip is not equal to 
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zero at t = 0. Thus, we have two conditions 

sR 0 + (a 0 fA 0 ) cos<p 0 > 0, sR 0 (a 0 /A 0 ) {1 + (rfs) 2 } 1 ' 2 cos (<p 0 + 9)> 0 

The first condition is usually satisfied when a 0 is small and sR 0 is positive as 
required for stable free-running oscillation. The second condition is then 
equivalent to 

cos (cp 0 + 6) > 0 (9.23) 

For a given Aco 0 = to s — w 0 ', <p 0 is uniquely determined from (9.19) and 
(9.23). 

The above discussions were in terms of voltage and current. In the micro- 
wave circuit, power is a more meaningful quantity; therefore, let us express 
the synchronizing range in terms of power. The available power from the 
synchronizing source is given by 

P, = (a 0 /,/ 2) 2 /4R 0 = fl0 2 /8R 0 

while the output power of the free-running oscillator is given by 
P 0 = RoiAolJl) 2 = \R 0 A 0 2 
Substituting these into (9.18), we have 



-£K)7 



(9.24) 



where the external Q is defined by (9.12). 

The injection of a synchronizing signal and the extraction of output 
power from an oscillator are generally done through a circulator matched 
to the waveguide characteristic impedance. In this case, the output power 
is not given by R 0 A 0 2 / 2 but is given by the power emerging from the oscil- 
lator through the circulator, i.e., the actual power to R 0 . From the discussion 
in Section 1.4, the output power becomes 

P = iRo' \ V - R 0 I\ 2 

where V is the voltage across the terminals of R 0 (including e(t)) given by 
V — E — R 0 I = (« 0 /V 2) - R 0 {(A 0 + AA 0 )!J 2} e*> 

Thus, to a first order approximation, we have 

P = P 0 + A 0 a 0 cos<p 0 ^ = P 0 jl + 2 ^ - lj cos<p 0 j 

(9.25) 
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When combined with (9.19) this equation tells us how P changes with 
Aco 0 = co s -a) 0 '. If the circuit is adjusted so that the maximum output is 
obtained under the free-running condition, one of the constant output power 
contours (jR 0 A 2 = const) plotted on the plane of Fig. 9.3(a) should be 
tangent to the curve R at A 0 . Since the constant output power contours are 
represented by 

}R 0 A 2 = i (R — Rj) A 2 = const 
the maximum power condition is given by 

(dRfdA) A 2 + 2 A (R - R,) = 0 

or equivalently 

(dRfdA) = - (2R 0 /A) 

Remembering that 5 is defined as - A 0 fR 0 times dRfdA at the operating point, 
the above result shows that when the circuit is adjusted for maximum power, 

s 1S equal t0 two ; and that the variation in the output power of our model 
due to the injection of synchronizing signal becomes zero to a first order 
approximation. 

9.3 Noise in Oscillators 

For the discussion of noise, it is necessary to evaluate the integrals on 
the right-hand sides of (9.7) and (9.8) for the noise voltage e{t). As a simple 
noise model, let us assume that e(t) consists of a large number of elementary 
pulses each of which is represented by sS(t - t 0 ), where e gives the strength 
of the pulse and t 0 the time of its occurrence. Both s and t 0 are independent 
random variables from one pulse to the next. Using (11.20) in Appendix II 
the autocorrelation function of e(t) is given by 

K«W = n<e 2 >5( t) (9.26) 

where n is the average number of pulses in a unit time, and <e 2 > is the 
ensemble average of the square of the pulse strength e. From (9 26) the 
power spectral density of e(t) is calculated to be 

\ e (f)\ 2 = n <£ 2 > = \e\ 2 (9.27) 

which is independent of frequency. When the spectral density is independent 
of frequency, the noise is said to be white. The thermal noise in resistors and 
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the shot- noise in vacuum tubes and p-n junctions are both considered to be 
white for most practical applications. 

Since Eqs. (9.7) and (9.8) are linear, the principle of superposition holds. 
Let us first consider the effect of just one pulse represented by e<5(r — / 0 ); 
the integral 



2T 0 1 f e(t) cos (cot + <p) dt 
J t -To 



(9.28) 



becomes a rectangular pulse with constant height (2 e/T 0 ) cos(w/ 0 + cp) and 
the length T 0 as shown in Fig. 9.5. Since T 0 is considered short for any 



e(t) 



«S ( t -tj 



€ COS(w t o f <£) 




etucos (o»t + (£) dt 



Fig. 9.5. The relation between 27 ^ ' j t _ To e(t)cos(a>t 4- <fi) dt and e(t) for a single 
pulse in e(t). 

variation in A, this pulse can be considered as another pulse having the 
form of a (5-function with strength 2e cos (cot 0 + cp) located at t = t 0 for the 
practical purpose of calculating the effect using (9.7). If n x (t) indicates the 
total effect of the integral (9.28), n l (t) consists of many such pulses. Since 
the autocorrelation function of n x (t) is easily calculated to be 2 «<e 2 > 
we have 

l«i(/)| 2 = 2k| 2 (9.29) 






Similarly, indicating the effect of 



by n 2 (t), we obtain 



2T o 1 e(t) sin (cut + cp) dt 
J I — Tq 



i «2 (/)( 2 = 2 kr 



(9.30) 



Furthermore, because of the orthogonality between sine and cosine func- 
tions, the cross-correlation between n x (t) and n 2 (t ) is calculated to be zero. 

We are now in a position to investigate noise in free-running oscillators. 
In the following discussion it will be assumed that r = 0 in order to simplify 
some calculations which otherwise become very lengthy. 

For the amplitude fluctuation we write AA = A- A 0 , and from (9.7) 

2 L (d AAjdt) + sR 0 A A = «, (f) 

Calculating the power spectral densities of both sides and dividing by 4cu 2 L 2 
+ S 2 R 2 0 we obtain 

\AA(f) | 2 = K(/ )' 2 



4co 2 L 2 + s 2 R 0 2 4 co 2 L 2 + s 2 R 0 : 



(9.31) 



If the output of the oscillator is detected by an envelope detector, we expect 
to observe the output spectrum given by (9.31). Strictly speaking, part of e(t) 
comes from the noise source in R 0 which introduces a slight modification in 
the discussion of the output power measurement as was done in Section 9.2. 
However, for most cases the difference may be negligible. 

For the phase fluctuation, setting r = 0 and cu = w 0 \ (9.8) becomes 



which gives 



- 2 L ( dcp/dt ) = n 2 (t)/A 0 

I (/) | 2= K(/)l 2 

4 co 2 L 2 A 0 2 4 to 2 ISA 2 



(9.32) 



If the output of the oscillator is detected by an FM discriminator following 
an ideal limiter, the output should be proportional to dcpjdt which has a 
white spectrum in this ideal case of r = 0. 

An ordinary spectrum analyzer shows the spectral density of the output 
power directly. The best way to calculate it may be to calculate the auto- 
correlation function of the current waveform and transform the result to 
the spectral density by a Fourier transformation. Since n x (t) and n 2 (t) have 
no correlation, A(t) and ip(t) are uncorrelated, and the autocorrelation 
function of R t ( t) becomes 
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Ri( t) = (A(t)A(t + t) cos(<u 0 r + (p(t )) cos (ct> 0 ^ + co 0 x + <p(t + x))> 

= | ( V + (*)) COS &> 0 t <cos (<p (t + t) - <p (/))> (9.33) 

where < > indicates the ensemble average and use is made of <sin(<p(f + t) 
-*(*))> = 0. 

Once the power spectral density of AA is available, the calculation of 
R aa {x) is straightforward. From (9.31), we have 



\AA(f)\ 2 ^ 

J - 00 



( S ^° I I 

eX P 



(9.34) 



Since the calculation of <cos(«p(f + t) — <p(f ))) is involved, let us first con- 
sider + t) — (p(t)) 2 ). If h(z) is unity from z = — x to z = 0 and is zero 
elsewhere, then referring to Fig. 9.6, x(f) = <p(f + x) — (p{t) can be expressed 



d(p ( 0 ) 



u(t-6) dO 



-i y 2 (e)u«-o) d 0 



(9.35) 



u <z) 




The Fourier transform X T (f) of x r (f) (see Appendix II for the definition of 
X T (f) and .v T (f)) is therefore given by the product of the Fourier transforms 
of n 2T {t) and u(t) multiplied by —\I2LA 0 . The power spectral density of 
x(t ) is given by 





(9.36) 
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From (9.36), ^(0) = <(„(, + 1 ) - ,(r))>> can be calculated as follows: 

*,(0)=f \xWdf-Wr L-cos. H 2 _|r| 

LA o 2 J-. CO 2 1 L 2 A 0 2 2 (9 ' 37) 



where we have used 



r i 

J — co 



1 — COS COT 



f 03 f ,r| sinco0 

dOdf 

J - <x> J 0 co 

r°° Hd ro 

= 2 J-oo Jo J - cosco ^ d ^ ded f 

/»|r| /•$ poo 

= i cos co£dfd£dd 



dddf 



Noting that <p(t + T)-<p(t) has the Gaussian distribution and that 
Llows- + ^ ~ WC Can n ° W calcuIate < cos (<P it + t) - <?(/))> as 

< C0S -*(»))> = £ (2 no 2 )-' 12 «p(- cot <pd<p 

= ex P (-y) = exp(-^|r|) (9.38) 

Substituting (9.34) and (9.38) into (9.33), we obtain 

*‘ (T> = 2 { V + 2-fk eXP (“ 2l |T| )} “P(- ^ W) 

The power spectral density of /(f) can now be obtained by the Fourier trans- 
ormation of the autocorrelation function. Noting that ji? 0 /2L > \e\ 2 /4L z A 2 
we have 0 ’ 

l>(/)l 2 = J R,(T)e-*"di 

= £*[{(--■*>* + (J;y } + {<»+ “o) 2 + (^ o - 2 ) 2 }“] 



4L 2 A 0 ‘ 



■ [{<“ - ®o ) 2 + (^j 2 } '+{(»+ « 0 ) 2 + gyj" ‘j 
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Neglecting the contribution from the noise source in R 0 , 2R 0 \i(f)\ 2 gives 
the output power spectral density function P(f) (/> 0) for the oscillator. 
The factor 2 in front of R 0 \i(f)\ 2 comes from the fact that both the positive 
and negative parts of the frequency spectrum contribute to P(f). Since 
co — (d 0 < co + to 0 , we have 



™-i£" 

* {(fi) - cu 0 ) 2 + (co 0 2 /4Q e 2 xt ) 2 (N/P 0 ) 2 + {<o- a>o) 2 + ((o 0 IQ ext ) 2 (s/2) 2 } 

(9.39) 

where N is given by 

N = \e\ 2 l2(R-R i ) (9.40) 



It is interesting to note that if an amplifier were built using the same negative 
resistance (R - /?,) with the same noise voltage e(t), the optimum noise 
measure obtainable would be given by 




Mo Pt = (WTd 



where k is the Boltzmann constant. 

In (9.39), P(f) consists of tow terms, the first is called the FM noise 
since a spectrum analyzer essentially shows this spectrum if the output of 
the oscillator is displayed after eliminating the amplitude fluctuation by a 
limiter. The second term is called the AM noise. Since the relation 

bo 0 Q7i (NIP 0 )<s 

usually holds, the first term predominates over the second when co is close 
to o)q. F©r large co - c o 0 , however, both terms give equal contributions. 

Let us next consider the noise in oscillators synchronized by injection 
locking. In the case of free-running oscillators, cp could take almost any 
value; when synchronized, however, cp should stay in the vicinity of <p 0 
because of the restoring force due to the synchronizing signal. Writing 
A — A 0 + AA 0 + AA, cp = (p 0 + Acp, and assuming AA, AA 0 , and Acp are all 
small and r = 0, Eqs. (9.7) and (9.8) become 




2 L (d AAjdt) + sR 0 AA + a 0 Acp sin (p 0 = (l) 

- 2 L (d Acpjdt ) - (floMo) A<P cos cp 0 = {n 2 ( t)!A 0 } 



• ;V ' 




I 

■%. 
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respectively. From these equations, we obtain 



\AA(f)) 2 = - -1%^- - + - -° 2 l in ^o_ _Mf)\ 2 

4co I + s R 0 2 4co 2 L 2 + s 2 R 0 2 4to 2 L 2 A 0 2 + a 0 2 cos 2 cp 0 

_2Jel 2 4 (o 2 L 2 A 0 2 + a 0 2 

4co 2 L 2 + s 2 R 0 2 4m 2 L^4 0 2 '+^ 2 TOs 2 7o 

\Acp (f)\ 2 = M/))l __ _ 2 \e\ 2 

4co 2 L 2 A 0 2 + a 0 2 cos 2 cp 0 4 co 2 L 2 A 0 2 + a 0 2 cos 2 cp 0 ^ 9 ' 42 ' ) 



(9-41) 



It can be seen by noting the presence of a 0 that the phase fluctuation is 
considerably reduced, but the envelope fluctuation becomes slightly larger 
with synchronization. 

In order to calculate the output power spectral density function P(f) 
let us first calculate *,(t). Noting that both AA and Acp are small, we have 



^'( T ) - H A o + AAq) 2 (1 - <d(/) 2 »cosm 5 T 

+ -jA 0 2 R Atp (x) cos co s x + ±R AA ( t) cos co s t 
+ Uo s, ncy s T «JA( t) ^^(0)) - <AA(Q) A<p(t)y) 



From this, we can obtain the power spectral density of i(t) by a standard 
procedure. If we neglect the contribution from the noise source in R 0 and 
calculate the power emerging from the oscillator following the method 
described in the end of Section 9.2, we obtain 



N o> 0 1 \ Nco o 



4 {PoPsY 12 Q ext cos cpj 4 Q e ~ 



Hf-L) 



+ - ]Sf ) 

2 &L l (rn - a>,y + M(0 0 |L, COS 2 <p 0 



_ 1 +|^®olL 1 

(“ - ( °.) 2 + (“o/6e„) 2 m 2 (co - a> s f + Mt 0 0 | 2 IX cos 2 Vo 

°X 2 \4(Op Imax sin (p 0 

(ft) - cof + K/fi„J 2 (s/2) 2 (ft) - co,) 2 + Mo> 0 | 2 ax co? ^ 



(9.43) 



where P is given by (9.25). The first term is a pure spectrum. The second 
term represents the noise component which in turn consists of three terms. 
The first term gives the FM noise, the second term corresponds to the AM 
noise, and the last term represents the interaction between the FM and AM 
noise. As can be seen by comparison of (9.39) and (9.43), the FM noise is 
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of the same order of magnitude as in the free-running case when (a> — oj 4 ) 2 
is large. However, as co approaches co s , a considerable improvement is 
obtained by synchronization compared to the free-running case (9.39). 
On the other hand, the AM noise is not improved by synchronization. As a 
matter of fact, since cos 2 (p 0 is smaller than unity, the AM noise increases 
slightly. If cos 2 <p 0 becomes small, i.e., if the synchronizing frequency is 
near the edge of the synchronizing range, both the FM and the AM noise 
become extremely large. In the limit of cos 2 (p 0 approaching zero, however, 
the assumption of A A and Aq> being small is no longer valid, and the above 
result should not be applied. 

The interaction term shifts the noise from one side of co s to the other if o s 
and ojq do not coincide, and as a result, the noise spectrum becomes un- 
symmetrical with respect to w s . The sign of the interaction term is such that 
the (o 0 side of w s always becomes noisier. 

So far the contribution from I// noise has been completely neglected. 
In practice, the circuit parameters of the active device are expected to fluc- 
tuate slowly with time, and the spectra of the fluctuations will be nearly 
proportional to 1 //, according to many experiments. This type of noise does 
not play a significant role in the case of ordinary high frequency amplifiers. 
However, in the case of oscillators, the situation is different. As shown in 
Section 9.1., the behavior of the amplitude and phase of an oscillating 
current is governed by (9.7) and (9.8). In these equations, R and X are 
expected to fluctuate with time. Concentrating on (9.7), if R fluctuates due 
to the fluctuation of the device resistance, then transfer the fluctuating part 
d^to the right-hand side. If we write AR(t) = n R (t)jA, n R (t) has the dimension 
of voltage, and the equation becomes 



(l H — 2 ^,'j — + (^i + — A = — - f e(l) cos (co + <p) dt + n R (t ) 

V a> CJ dt ToJi-to 

(9.44) 

Similarly, (9.8) becomes 



\ coC ) \ to 2 Cj dt 



e{t) sin (cot -f <p) dt + 



(9.45) 



where n x (t ) = A AX(t). | 

Equations (9.44) and (9.45) are the fundamental equations for A and <p | 
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when 1// noise is taken into account. The terms \n R (f)\ 2 and \n (/) | 2 are 
both proportional to 1// In general, n R (t) and n x {t) are independent of 
« x (/) and n 2 (t), the white noise we previously introduced. However, n R (t) 
and n x (t) may be cross-correlated with each other, and this makes the 
calculation of |/(/)| 2 complicated even when r = 0. On the other hand, 
the spectrum calculation of phase and amplitude fluctuations remains 
straightforward. For instance, in the case of free-running oscillators the 
results for \AA(f)\ z and |<p(/)| 2 remain valid if 2\e\ 2 is simply replaced by 

2 1*1 + and 2 \ e \ 2 + M/)| 2 , respectively. It is worth noting that the 

spectrum of frequency fluctuation \dq>(f)/dt\ 2 is no longer white. Depending 
on the relative magnitudes of 2|e[ 2 and \n x (f)\ 2 , the spectrum exhibits a 
certain slope, with the steepest region being about 3 dB per octave. 

The calculated result of the phase fluctuation for synchronized oscillators 
also remains valid if 2\e\ 2 is replaced by 2\e\ 2 + \n x (f)\ 2 . Phase fluctuations 
can therefore be greatly reduced by synchronization, even if 1// noise is 
taken into account. The amplitude fluctuation contains a new interaction 
term between n R {t) and n x (t), and whether or not this fluctuation is reduced 
by injection locking depends on the correlation between the two. If they are 
uncorrelated, the amplitude fluctuation will be increased by injection 
locking as before. 

PROBLEMS 

9.1 Assuming that the reactive part of the device impedance is a function of the current 
amplitude, draw a Rieke diagram corresponding to Fig. 9.4(a). 

9.2 Calculate the angle with which the constant power and constant frequency contours 
intersect each other at the operating point in the previous problem. 

9.3 Prove that generalized functions corresponding to two different continuous functions 

are not equal. Use defined by (11.16) in Appendix II to test the equality. 

9.4 Calculate the autocorrelation function of A cos {cot + <p) and its power spectral 
density. 

9.5 Show that the power spectral density of shot noise current is given by 

l*(/)|* = el 0 

where /o is the average current and * is the magnitude of electronic charge. Note 
that this gives (7.71) when the contribution from the negative frequency part of the 
spectrum is taken into account; i.e., </„ 2 > = 2|/(/)| 2 B. 

9.6 Suppose that the synchronizing signal is noisy and represented by 

(ao + Jo)cos(w 5 / + v/), where Aa and v indicate the amplitude and phase flue- 
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tuations, respectively. Show that the power spectral densities of the amplitude and 
phase fluctuations of the synchronized oscillator are given by 

\AA{f)\* = Ro 1 {(*o) 2 Q\ xt + W}- 1 {« 2 + Mrool^^cosVo}- 1 

X {|Jwo|^ ajc cos 2 <polPs sin 2 po|^(/)| 2 4- (« 2 + M"o|^ ax ) N} 

2 C 2 (O 0^ N i 

I (<P ~ 9 > o )(/)[ 2 “ {« 2 + MooUaxCOS 2 ^}- 1 ]M<yo| ma x COS 2 <po\v(f )\ 2 + — — [ 

' ext 

where it is assumed that Po > Ps, ao > Aa, and r = 0. 

9.7 Show that |zM(/)| 2 and |p(/)| 2 of a free-running oscillator become 
|zM(/)| 2 = NR~ l {(ru/wo) 2 G 2 exl + (s/2) 2 }-* 

\vU)\* = ( N/Po ) (2Q 2 ext (eo/coo) 2 }" 1 [1 + (r/2) 2 {(cu/cuo) 2 Q* xt + (s/2) 2 }-i} 



when the nonlinearity of the device reactance is taken into account, i.e., r / 0. 







appendix I 

PROOF OF lim L 



In this appendix we shall prove the infinite growth of the eigenvalues 
which we used in the discussion concerning the completeness of eigen- 
functions. 



1. One-Dimensional Case 
The proof will be presented in several steps. 

Lemma 1. Let /and g be single-valued real functions of x, then 

j f 2 dx j g 2 dx ^ fg dx) (U) 

where the integrals are from x = a to b (b > a). 

Proof. Noting that the square of a real function is positive, we have 

0 ^ J (f j 9 2 dx - g j fg dx^j dx 

= S fl dX (I g2 dX ) ~ 2 {j fgdx ) j e 2 dx + j g 2 dx (j fg dxj 

” I 92 dX {j f2 dX j g2 dx ~ (/ fg dx ) 2 \ 

Since the integral of g 2 is positive, the terms inside the brackets on the right- 
hand side must be positive and the proof of (1.1) is complete. 
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Lemma 2. Let / and g be single-valued real functions of x, then 






dx < 2 (\ f 2 dx + 






where the integrals are from x = a to b(b > a) 
Proof. From 



we have 



0 < J (/ ± g) 2 dx = j f 2 dx ± 2 J fg dx + J g 2 dx 

2 j fg dx\*Z j f 2 dx + j g 2 dx 



Substituting (1.3) into 

J (/ ± df dx < J f 2 dx + 2 | J fg dx | + j g 1 dx 

we obtain (1.2). 

Lemma 3. Let / be a single- valued function of x whose derivative is 
square-integrable over L. 

Let us divide L into N sections and indicate the fth section by L t . Let 
/(/) be the average of / over L h then 

/(i) = L7 1 f fdx 

J L, 

A staircase function can be defined over L such that its value in each L { is 
equal to /(/). If this staircase function is indicated by f then 



j -ffdx^K j JdfldxY 



where AT is a constant which approaches zero as JV->oo and max 0. 

Proof Since / is continuous, its value becomes equal to /(i) somewhere 
in L x . Let x 0 be such a point, then we have 



/“/( 0= f ( dfjdx) dx 
J Xn 



in L|. Squaring both sides and applying (1.1) with / being replaced by 
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dfjdx, and g by I, we obtain 

{/-/(0} a «ff (df /dx) dx 



f dx f ( df/dx ) 2 dx ^ L; f ( dfldx f dx 

W XQ J Xq J L/ 



It follows from this that 



Id'-" 



dx ^ L maxL 



.• j L (df/d X y 



Trr lv° ES *7°°’ then L maXZ,< a PP roaches zero. Writing L maxL,. 
as K, (1.4) is proved. 

Lemma 4. An infinite sequence a 2 ,. ... sat i s f ying 
K|<C (n — 1,2,...) 

contains at least one subsequence which converges to a finite constant The 
constant C m the above inequality is fixed and independent of n. 

Proof. Bisect the section from - C to + C at its center and classify the 
a s into two groups, one satisfying - C a a „< 0, and the other satisfying 

5 ‘ eaS ‘ ° ne of the two S rou P s should contain an infinite number 

o a„ s Again let us bisect the section containing this group at its center 
len at least one of these two sections contains an infinite number of a ’s’ 
as before Continuing this process indefinitely, we see that there is a constant 
a somewhere between - C and + C such that ot + a contains an infinite 
umber of s no matter how small a is. Let a(m) be the first a „ to appear 
within the range a + (1/m) where m is a positive integer. By increasing m 
sequentially, an infinite sequence ...,a(m),a(m + 1),... will be obtained. 

a particular a(m) happens to be the same a n as previously used for a(m - /) 
this aim) will be omitted. The sequence thus obtained is a subsequence of 
e a„ s, and it converges to a. This completes the proof. 

satisfying 11 ' 3 A SeqUenCe ° f funCtions f " = ’■ 2 °°>, each 

//” 2 dx < c • (dfjdx) 2 dx^C (l. 5) 

contains a subsequence such that 



lim f (fn ~fmY dx =0 

J L 
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n 



Proof. The proof will be carried out in two steps. Divide L into N 
sections as we did in Lemma 3, and let / be the staircase function whose 
value within L t is equal to /(/). In the first step, we shall prove that 

lim f {(f.-fJ-(L-Lf)dx ^0 (1.6) 

N-*co J 

where maxL f -»0 is assumed as N-> co. 

For simplicity, we write 



*P ~ fn f mi *P fit f n 



then we obtain 



j J ( <p 2 - (p 2 ) dx < J | (p 2 - (p 2 1 dx J \(p + (p\\<p- (p\ dx 
< | J (<p + (py dx j o - (py dxj 



where use is made of (1.1) for the last inequality. For the first term in the 
square root, Lemma 2 gives 



J (up + (p ) 2 dx < 2 (p 2 dx + J <p 2 dx'j (1.8) 

To evaluate the upper bound of the right-hand side, each term is calculated 
using (1.2) and (1.5) as follows: 

J <p 2 dx = J (/„ -/J 2 dx i 2 J (/„ 2 + f m 2 ) dx < 4 C (19) 

rf* = J (/„ - /J 2 dx = l L,{/.(i) - /„(i)} 2 <21 L, {/„ 2 (0 + /, 2 (0} 

(1. 10) 

For the last inequality, we used the relation (A ± B) 2 ^ 2(A 2 + B 2 ) which 
is equivalent to 0 < (A ± B) 2 . From the definition of /„(/), we have 

E UK (0 = E £ i ((-7 1 j L L d f « E W l ) 2 U / /„ 2 * 

= J f n 2 dx^C (I- 11 ) 

where (1.1) and (1.5) are used. Combining (1. 10) and (1.11), we have 

f ip 2 dx 4C (U2) 
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From (1.8), (1.9), and (1.12), we obtain 

J (q> + (py dx < 16C (1.13) 

Similarly, using (1.4), we have 

J (<P — <p) 2 dx ^ K j ( dipjdx ) 2 dx 

K j {{dfjdx) 2 + (dfjdx) 2 } dx < 4KC (1.14) 

Substituting the right-hand sides of (1.13) and (1.14) in the square root of 
(1.7), we obtain 

j (<p 2 -<p 2 )dx^SjKC 

If max L t -* 0 as N-*co, then K—>0 and (6) is proved. 

In the second step, we prove the existence of a subsequence of the /„’s 
such that 

f (/„ —fm) 2 dx->0 (1.15) 

m, n-»co J L 

A combination of (1.15) with (1.6) then gives Rellich’s theorem. 

We first note that 

f (/.-A) 2 d.x= £ M/.(0-/.(0} 2 
J L i = 1 

= E (U ,2 /.(0 - l'/VMV (i.i6) 

i=l 

and from (1.1 1) 

4 /2 l/„(0l<Vc 

By Lemma 4, there is a subsequenceof the /„’ s for which Lj /2 /„(l) converges. 
Again by Lemma 4, using the members of this subsequence only, we can 
make another subsequence for which Lj /2 /„( 2) converges. Repeating this 
process, we obtain a subsequence of the /„’ s for which Lj /2 /„(/) converges 
for every /. For this subsequence, the right-hand side of (1.16) approaches 
zero as m, «-> co, i.e., (1.15) is satisfied. The proof of Rellich’s theorem is 
thus completed. 

Proof for lim n _ Q0 /c n 2 =oo. Let E yn be the wth eigenfunction which is 
normalized then the corresponding eigenvalue k„ 2 can be expressed in the 
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If k„ 2 remains finite, then the E yn ' s satisfy the condition for Rellich’s 
theorem, and there must be E yn and E ym satisfying 

J ( E yn - E ym f dx^e 

no matter how small s is. However, the left-hand side is equal to 

J Eym dx -2 j E yn E ym dx + | E) m dx 

in which the first and third terms are both equal to 1 , and the second term is 
equal to zero from the orthogonality condition. This means that 
f ( E yn — E ym ) 2 dx cannot be less than 2 leading to a contradiction. We 
conclude from this that the assumption of k„ 2 remaining finite is wrong, or 
equivalently, that Yvm^^k 2 = co. 



2. Two-Dimensional Case 

Lemma 1 . Let / and g be single-valued real functions of * and y, then 

J f ds | g 2 dS > fg dsj 

where the integrals are over a certain area S. 

Lemma 2. Let / and g be single-valued real functions of x and y, then 

j(f±gf ds «S2 (J / 2 dS + J g 2 dS^ 

where the integrals are over a certain area S. 

The proofs of the above two lemmas are similar to those for the one- 
dimensional case, and hence will not be presented. 

Lemma 3. Let / be a single-valued real function of x and y having a 
square-integrable derivative V / over S = Let /(/') be the average of / 

over S h 

fdS 




m=sr f 

J s, 
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and define the staircase function / whose value is equal to /(/) in S t for 
every i. We then have 

f (f-ffdSUK f (VffdS 
J s J s 

where K is a constant, and it is possible to choose the S/s so that K ->0 
when every S t -» 0. 

Proof. Suppose that S t is an area enclosed by x = 0, x = a, y = 0 
y = g(x) such that the maximum value of g(x) is smaller than h, and the 
minimum value is larger than t as shown in Fig. A.l. Let R be the shaded 




Fig. A.l. Domain Si and the integral path PoP. 



area below t, then there is a straight line M parallel to the x-axis and located 
in R such that 

r f (Vff dx=[[ (V/) 2 dx dy (1.17) 

J M J J R 

Such a line M exists since the right-hand side can be written in the form 

f f(V /) 2 dxdy 
J oj o 

where the integral with respect to x gives a continuous function of y which 
can be integrated from y = 0 to t. There should be at least one point some- 
where between y = 0 and t at which the value of the continuous function of 
y becomes equal to the average value for this range of y. Let us draw a 
straight line M parallel to the x-axis through this point, then the left-hand 
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It 

side of (1.17) becomes the average value times t , which is equal to the 
right-hand side. 

Let / be a unit vector, then the value of 

V/ • I = df I dl 

becomes maximum when the directions of I and V/coincide. Otherwise, the 
value becomes smaller than the magnitude of V/. It follows from this that 

{dfldlf < V/ - V/ - ( dfldx f + {dfjdyf (1.18) 

Let P 0 be a point where / takes the value of /(/) in S„ then the value of /at P 
in Sj is given by 



from which we have 



/ = /(/)+ f (dfldl)dl 

J Po 



{/-/( 0} 2 = f Widi)di f (dfidi)dKL f {dfldlf dl 
J Po j Po J Po p 

< L f miSxf + {dfjdyf} <H = L f (V/) 2 dl 
J Po j Po 

where L is the path length from P 0 to P and the relations (1. 1) and (1.18) 
are used. If we choose the path as shown in Fig. A.l, and integrate both 
sides over S £ the following inequality results. 

f {/ - f{i)f dsJ L f ° (V/) 2 dy dS + I l( (V/) 2 dy dS 
J Si J Si J O J Si J 0 

+ f l| ( VffdxdS (1.19) 

J Si J M 

Using the inequalities 

C r-P fh fa fg{x) r 

(VffdydS^l (V/) 2 dydxdy^h I (V/) 2 dS 

J Si J 0 Jo Jo Jo J St 



f f (v/) 2 cJx^s^f r 1 f (v/) 2 dsds<ahr l f (v/) : 

J Si J M J Si J Si J Si 



Eq. (1.19) becomes 



J {/- /(/)} 2 dS « ( 2 fc + a ) ( 2 / i + ^ r ‘) J s (V/) 2 rfS ( 1 . 20 ) 



where L ^ 2h +a is used. 
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Let us divide the given area S into a large number of small areas, each of 
which satisfies the condition explained in connection with Fig. A.l; any 
rotation or translation of the coordinates will be allowed. Suppose that t 

of each area is kept equal to or larger than a constant a times h, i.e t> a h 
then ’ * 

f s (/ - /) 2 = i J s (/ - mv ds 

< £ (2 h + a) (2h + act - ') (V/) 2 dS < K J (V/) 2 dS 

and K approaches zero as we decrease the maximum size of S { . This com- 
pletes the proof. 

Rellich’s theorem. A sequence of functions /„ ( n = 1, 2, co), each 
satisfying 

jjfdsac, jW'fds^c 

contains a subsequence for which 

lim f (f.~f„fdS = 0 

m,n — < x J S 

1 he proof for the one-dimensional case will serve here almost word for 
word and hence we shall not repeat it. 

Proof for hm B _ C0 Ar„ 2 = oo. Let E, be a normalized eigenfunction, then 
the corresponding eigenvalue k 2 can be expressed in the form 

* 2 = J (V x E,) 2 + (V • E,) 2 dS 

which is equivalent to 

1(2 = J (E f • V x V x E, + V.(E, x V x E f ) + V.(E t V-E r ) 

“ (E f » V) (V • E,)} dS 

The integrals of the second and third terms in the brackets can be converted 
into line integrals, both of which disappear because of the boundary con- 
ditions for E,. The first term can be decomposed into two terms using the 
universal equality 

V x V x E, = V (V • E,) — V 2 E f 
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the first of these, when multiplied by E„ is equal to the last term in the 
integrand. Thus, we are left with 



~J E - 



= - £ 



V 2 E, dS 

d 2 E x d 2 E x d z E v 8 2 E v 

i^ +e '- S 7 +e ^ +e ’W 



Using 



d 2 E x d dE x ( dE x \ 2 

dx 2 dx x dx \ dx ) 



and similar equations, (1.21) can be rewritten in the form 

k 2 = J «V £,) 2 + (V£,) 2 } dS - l 2 J (E x 2 + E/) + ~ {E , 2 + £/)| dS 



= J {(V£,) 2 + (V£ y ) 2 } dS - l - J V • V (E, • E,) dS 



The second integral on the right-hand side can be converted into a line 
integral which disappears for the following reason. On the boundary 
V -E, = 0, and in terms of the n and l components this becomes 

(dEJdn) + (dEJdl) = 0 

Since E, is equal to zero on the boundary, we have ( dE t jdl ) = 0 and hence 
(dEJdn) = 0. The integrand of the line integral is given by 

n. V(E,-E,) = {a(E„ 2 + E?)ldn} = 2 {E n (dEJdn) + E,(dEJdn)} 

which is equal to zero since dEJdn and E t both vanish on the boundary. 
Equation (1.22) now reduces to 



k 2 = J {(VE,) 2 + (VE y ) 2 } dS 



Let E xn and E yn be the x and y components of the nth eigenfunction E,., 
then from the above expression of the eigenvalue, both the E X J s and the 
E y J s satisfy the conditions stated in Rellich’s theorem if the kj's remain 
finite. Consequently, we can choose a subsequence of the E,„’s for which the 
integral of (E xn — E xm ) 2 converges to zero as n and m grow indefinitely. Let 
us choose from this subsequence another subsequence for which the integral 



3. Three-Dimensional Case 



409 



of (E yn E ym ) 2 converges to zero, then for such a subsequence, we have 

/ (E '” - E “> 2 dS = „ lim / {( £ .» - E* m ) 2 + (£,. - EJ 2 } dS = 0 

However, this contradicts the orthogonality and normalization conditions 
tor the E,„’s, which means that the assumption of the kj's remaining finite 
is wrong. This completes the proof of lim n ^^kJ = oo. 

3. Three-Dimensional Case 

Lemmas 1, 2, 3, and Rellich’s theorem in the three-dimensional case are 
a identical to those in the two-dimensional case except that V replaces S 
everywhere. However, we have to use Fig. A.2 instead of Fig. A.l to prove 




Fig. A.2. Domain Vi and the integral path P 0 P. 



Lemma 3. Here, the maximum value of g( x , y) i s less than h and the mini- 
mum value is larger than ( > 0. Also, V, expresses the volume under r, and 
the plane S t is such that 

dv 



f (\ffds = ,-'[ (y f y 

J S > J V, 
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The line M on S t is parallel to the x-axis and satisfies 

f (Mf) 2 dx = b~ 1 f (Vf) 2 dS 

J M J S, 

If the integral path from P 0 to P is taken along the bold line shown in 
Fig. A. 2, corresponding to (1.19) and (1.20) we have 



j y {/-/(/)} J o °(V/) 



f f (\f ) 2 dz dv + [ f(V /) 2 

J Vi J 0 J Vi J 0 

f fm 2 dydv+( f (\fY 
J Vi J 0 J V, J M 



^(2h + 2b + a){2h + 2bht~ 1 +abhb~ 1 t~ 1 } f (V /) 2 dv 

J v, 

If we divide the given volume V into a large number of smaller volumes 
similar to Fig. A. 2 while keeping t ^ ah, then 

if - /«} (V /) 2 dv 

and K -* 0 as the maximum size of the small volumes approaches zero. The 
proof of lim„_ c0 /r n 2 =oo is now obvious from the previous discussions 
for the one- and two-dimehsional cases. 



4. Resonant Cavities with Inhomogeneous Media 

As we discussed in Section 4.5, the eigenfunctions for a resonant cavity 
with an inhomogeneous medium can be classified into three groups, I, II, 
and III. We shall first show that the number of independent functions in 
group I is equal to that of independent functions in group I for the same 
cavity with a homogeneous medium. We shall then show that the eigenvalue 
of the /th eigenfunction in group II cannot be smaller than that of the /th 
eigenfunction in group II for the same cavity with a homogeneous medium 
whose relative dielectric constant and permeability are given by e M and fiu, 
respectively, where s M and f.i M are the maximum values of e r and \i r in the 
inhomogeneous medium. Finally, we shall show that the eigenvalue of the 
/th eigenfunction in group III cannot be smaller than that of the /th eigen- 
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function m group III for the same cavity with a homogeneous medium whose 
relative dielectric constant and permeability are given by e m and (i m respec- 
tively, where e m and Min are the minimum values of and fi r . Since the 
eigenvalue k n for a cavity with a homogeneous medium grows indefinitely 
„ for the cavity with an inhomogeneous medium must also grow indef- 
initely with n, which completes the proof. 

Now suppose that E represents an eigenfunction in group r then it 
satisfies V x E = 0 in F and n x E = 0 on S, where V indicates the volume 
m the cavity, and S the wall surface. By Helmholtz's theorem, E is expressible 
as V, p. Substituting this into V. e ,E = 0 and nx E = 0, we have 

V -ny, p = 0 (in V), n x V<p = 0 (on S ) 

This boundary condition requires that <p is constant on each independent 
wall, and the problem is reduced to that of finding static potential functions 
in the closed region. Let N be the number of independent walls, then there 
are exactly N - 1 independent solutions for <p since there are only N- 1 
independent ways of assigning the potential to N - \ walls with respect to 
the remaining one, whether or not the medium is homogeneous. This 
completes the discussion of group I. 

Let us next consider E in group II. We shall discuss the problem in two 
Steps. In the first step, we shall show a maximum-minimum property of 
eigenvalues. In the second step, we shall compare the magnitudes of the 
/th eigenvalues for the homogeneous and inhomogeneous cases. 

For the functions in group II, V-e r E = 0 and we have 

V x Mr ‘V x E — Ar £ ,E = 0 (in V), nxE = 0 (on S) (1.23) 

If we restrict ourselves to the class of functions which satisfy n x E = 0 on 5 
and do not contain a term expressible as the gradient of a scalar function 
the variational expression for the eigenvalue of (1.23) is given by 




e eigenfunctions m group III can all be expressed as the gradient of some 
scalar function, and hence they are excluded from the present E’s. Let 
be piecewise-continuous, but otherwise arbitrary functions, 
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and assume that E satisfies 



Let us define D( E) by 



J e,E • F ; tfu = 0 (UUZ- 1) 

J £ r E 2 dv = 1 
by 

D (E) = J x E) 2 dv 



then the value of Z>(E) varies with E. Let D be the minimum value of Z)(E) 
when E varies over all possible functions satisfying (1.24) and (1.25) with 
the F,’s being fixed. If the F,’s change, D of course changes. Let us try to 
find the largest value of D when the F/s are allowed to vary over all possible 
functions. We first consider a function in the form 

E = £ c,E, (1.26) 

i= 1 

where E l5 E 2 ,..., E, are the first / eigenfunctions in group II. From the 
conditions (1.24) and (1.25), all the cfs are determined. If i # j, the expression 

J /i r _1 (V x E,).(V x E j) dv = J x V x E j dv 

+ J E f x n ~ 1 V x E j • n dS 

is equal to zero, since the first term on the right-hand side is equal to 
J k j 2 e r E i ''Ej dv which vanishes because of the orthogonality condition; the 
second term vanishes because n x E, = 0 on S. Using the above relation, the 
value of Z)(E) for (1.26) can easily be calculated as 

D( E)= £ c,% 2 

i = 1 

However, since k 2 ^ k 2 (1 < / < / — 1) and £ c 2 = 1 from (1.25) and (1.26), 
we have 

0(E)O, 2 

This inequality holds for at least one E given by (1.26) regardless of the 
Fj’s used, which means D ^ k 2 . Now suppose that F t = E ls F 2 = E 2 , •••» 
Fj_i = E,_ l5 then (1.24) gives the conditions under which the variational 
expression A: 2 (E) is minimized to obtain E f . Because of (1.25), the minimum 
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value of /)(E) under these conditons is given by k 2 . Combining with D ^ k 2 y 
we conclude from the above discussion that the largest value of D is k 2 , 
and that the function which gives this value is the /th eigenfunction E,. 
This completes the discussion of the maximum-minimum property of the 
eigenvalues. 

In the second step, we first change ii r to fi M . With the F/s (1 </</ — !) 
and E being fixed, 

0'(E) = J^‘(VxE) 2 dS 

cannot be larger than Z>(E). The minimum value D' of D'( E) cannot 
therefore be larger than D. It follows from this that the /th eigenvalue k’ 2 
in group II for the same cavity with e r and n M is smaller tnan k 2 , thus 

K 2 < k 2 

because the largest value of D' gives k\ 2 . Next, we change e r to e Af . The 
conditions necessary to obtain the /th eigenvalue k" 2 for the cavity with 
s M and n M become 

J * F/ dv = 0 (1 < i < / - 1) (1.27) 

J £ M E 2 dv = 1 (L28) 

If we set F/ = F i (£ r /£ M ), (1.27) reduces to (1.24), in order to satisfy (1.28), 
however, E must be multiplied by a constant c smaller than unity. The 
corresponding Z)(E), which we shall indicate by D"(E), must be equal to 
c 2 D' (E), and hence 

D"{ E) ^ D'(E) 

Let D be the minimum value of Z)"(E) when E varies over all possible 
functions, then D" ^ D'. If we change F f over all possible functions, the corre- 
sponding F/ also varies over all possible functions. However, the largest 
value of D" cannot exceed the largest value of D' because of the relation 
D'^D', which means k" 2 < k[ 2 < k t 2 . We conclude from this that the 
eigenvalue of the /th eigenfunction in group IT, for a cavity with £ r and fi r , 
cannot be smaller than that of the /th eigenfunction in group II for the 
same cavity with e M and fi M . Since the latter grows indefinitely with /, so 
does the former. 

We finally consider E in group III. Since V x E = 0 in V, the eigenvalue 
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problem for E is given by 

\V-e r E + k 2 E = 0 (in V), nxE = 0, V.s r E = 0 (on S) 
Writing V*6 r E as k(p, we obtain 

V ’S r \(p + k 2 (p — 0 (in K), (p = 0 (on S) (1-29) 
For each <p satisfying (1.29), a corresponding E can be obtained from 

kE = — \(p 

Note that nx E = 0 on S is automatically satisfied. If we restrict ourselves 
to the functions which satisfy cp = 0 on S, the variational expression for the 
eigenvalues of (1.29) is given by 



J e r (V<pY 



k 2 (<p) = 



Let ipi- 1 be piecewise-continuous, but otherwise arbitrary, 

functions and assume that ip satisfies 



J (pif/i dv — 0 (1 < i < / — 1) 

J <p 2 dv = 1 



Let us define D(<p ) by 



D((p) = J £ r (Vc/>) 2 dv 



and let D be the minimum value of D((p) when (p varies over all possible 
functions, then an argument similar to the one employed for group II 
shows that the eigenvalue of the /th eigenfunction in group III for a cavity 
with e r cannot be smaller than that for the same cavity with e m . This com- 
pletes the proof for E. 

For H, the number of independent eigenfunctions in group I is equal 
to the number of independent loops formed by the wall surfaces. Apart 
from this, all the arguments are similar to those for E. 



APPENDIX II 

GENERALIZED FUNCTIONS 
AND FOURIER ANALYSIS 



In this appendix, we shall introduce the concept of generalized functions 
in order to simplify our discussion of oscillator noise in Section 9.3, This 
concept was developed from Dirac’s “delta function” <5(x) which satisfies 

1 * 0 0 

8(x) K (x) dx = K (0) 



for any “reasonable” function tf(x). No ordinary function can satisfy the 
above equation. However, we can consider the limit of a sequence of func- 
tions such as (n/ny 12 exp(- nx 2 ) (n = 1, 2, 3, ...). The value of these func- 
tions tends to be zero with increasing n at almost every point except x = 0 
while the area under the curves described by the functions remains equal 
to unity. The following discussion will give this intuitive approach a mathe- 
matical foundation, and not only justify the use of <5(.x) but also greatly 
reduce the complications in Fourier analysis such as the determination of 
conditions for convergence, term by term differentiability, and uniqueness. 

We start with terminology. A function is said to be good if it is every- 
where differentiable any number of times; also, its values and those of all its 
derivatives reach the order of \x\~ N at most as |*| increases, i.e., 0(|*| _w ), 
for all A. A sequence f„(x) of good functions defines a generalized function 

Ax), if 




exists for any good function Ai(x). The integral of the product of a generalized 
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function /(x) and a good function K(x) is defined by 

f f(x)K(x) dx = lim f f n (x) K(x) dx (1 1.1) 

J — CO II - * oo J — CO 

If two sequences of good functions, /„ (a) and g n (x), satisfy 

/•co /*oo 

lim f„(x)K(x)dx= lim g n (x)K(x)dx (II.2) 

B-*CO J —00 IJ— * CO J —00 

for any good function K(x), then the sequences are said to be equivalent, 
and the corresponding generalized functions /(x) and g(x) are said to be 
equal. The equality is indicated by 

/to = *(*) (H3) 

Thus, each generalized function is really the class of sequences all equivalent 
to each other. The above definition of equality is, of course, consistent 
with the definition of integral (II. 1) since the limit on the right-hand side is 
the same for all equivalent sequences (see p. 425 for the physical meaning). 

If two generalized functions /(x) and g (x) are defined by two sequences 
f„(x) and g n (x), then the sum f{x) + g(x) is defined by the sequence 
fn(x) + g„(x). Also, a generalized function f(ax + b) is defined by the 
sequence f„(ax + b), where a and b are constants. The derivative /'(x) of a 
generalized function is defined by the sequence /„'(*). Since K'(x) is a good 
function for any good function K(x), the limit 

lim f fn'( x ) K ( x ) dx = - lim f f„(x) K'(x) dx (II.4) 

n -» co J — oo n-*co J — oo 

exists and is the same for all the sequences equivalent to /„(x). From (II.4), 
we have 

/•CO /• co 

f(x)K(x)dx = - f(x)K'{x)dx (II.5) 

J — co J — 00 

which is consistent with the definition of the equality between generalized 
functions. If two generalized functions g(x) and /(x) are equal,/' (x) = g'(x). 

Let us next consider Fourier transformations. The Fourier transform of a 
good function /„(x), 



F,(y)= r 

J — CO 



is also a good function. This can be shown by differentiating (II. 6) m times 
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and integrating by parts N times as follows : 



( 2 jA m N C°° i d N 

{x m Mx)} dx=0(\y \~ N ) 



The inverse transform of (II.6) is given by 



/.(*)" F,(y)e i2 ™dy (II.7) 

J - oo 

To prove this, let us investigate the magnitude of 

/•OO 

fn(x)~ F„(y)exp(-5 2 y 2 )e j2 * xy dy 

J — CO 

Substituting (II. 6) into this expression and noting that both f n (x) and 
F„(y) are good functions, we have 

/•co /»oo 

fn(x)~ / n (t)exp(- d 2 y 2 )e j2n{x ~ ,)y dt dy 

J — 00 J — CO 

< fn(x) ~ /„(/) expj- (t - x) 2 j dt: 

1 f 03 1 f 00 / <5 \ 

“ -7z | fn W ex P ( — z 2 ) dz j= j fjx + -z)exp(- z 2 )dz 



lx/ 71 J- 






j n v"/ — f /—i 

max|/„'(x)| -z exp(-z 2 )dz 



which leads to (II.7) in the limit of (5-»0. In the above calculation, we have 
used the relation 

/* cO _ 

exp(- z 2 ) dz = Jn (II.9) 

J * co 

which can be derived using a transformation to a polar coordinate as 
follows : 

a co \ 2 /*co -co 

exp(-z 2 )dzj = exp(— x 2 ) dx exp(-y 2 )dy 

“CO / J — oo J — co 

/•CO 

= exp(— r 2 ) 2nr dr — n 
Jo 

Notice that the limits of the integral on the right-hand side of (II.9) can be 
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replaced by — go + jA and oo + jA without changing the value of the integral 
where A is a real constant. 

Let us now define the Fourier transform of a generalized function 
f(x) by the sequence F n (y ), where F n (y ) is the Fourier transform of /„(x), 
then 



f CO 

*W= /W 

J — CO 



while the inverse transform of F(y) becomes f(x) from (TI.7) : 



/(*)= f F(y)e» 

J — 00 



( 11 . 10 ) 



(IUl) 



Notice that the Fourier transforms of f(ax + b) and f'(x) are given by 
|a| -1 e j2nby!a F(y/a) and jlnyF(y), respectively. Furthermore, since 



we have 



/»C0 i»00 fCO 

f„(x)K(x)dx=\ F„ (y) K (x) e J2nxy dx dy 

J — oo J - co J — oo 

= f” F„(y)k(y)dy (0.12) 

f” f(x)K(x)dx=F F(y) k(y) dy (11.13) 

J — 00 J — 00 



where k(y) is the inverse transform of isf(x). 

We are now in a position to define a generalized function <5(x). Let us 
consider a sequence (n/n) 112 exp(— nx 2 ). First, we note that each member 
of the sequence is a good function. Next, since 

f (n/n) 112 exp(- nx 2 ) K(x) dx - K(0) 

I J — CO 

I P 00 

= {nln) >12 exp(— nx 2 ) {K(x) — X(0)} dx 

I J — CO 

max |K'(x)| f (n/n) 112 exp(— nx 2 ) \x\ dx 
J - co 



= (nn) 1/2 max |K'(x)| -*■ 0 



as n — » co, we have 



lim f (njn) U2 Qxp(— nn 2 ) K(x) dx = K(0) 

n~* oo J — co 

for any good function K(x). Thus, sequences equivalent to (h/ti) 1/2 exp( — nx 2 ) 
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define a generalized function. If we indicate this generalized function by 
S (x), we have 

5 (x) K (x) dx = K (0) (11.14) 

- QO 

Since the Fourier transform of («//r) 1/2 exp(- nx 2 ) is exp(- n 2 y 2 /n), the 
Fourier transform of <5(x) is one. 

An ordinary function /^(x) can be considered as a generalized function if 
(1 + x 2 rV 0 (x) is absolutely integrable from - co to oo. To show this, let 
us consider a sequence defined by 

poo 

/„(*) = fo(t)o(nt - nx)nexp(~ t 2 ln 2 )dt (11.15) 

J — co 

where o(z) is any good function which is zero for |z| ^ 1 and positive for 
|z| <1 while j‘_j o(z)dz = \. Such a function is obtained, for example, by 
setting a = - 1 and ft = 1 in \j/ afi (z) given by 

^(z) = ^'cxp{ ( -^i 2+( -r ^ 2 j for a<z <fi 

= 0 for z ^ a or z ^ (11.16) 

where 

A = f exp { — ^ + 7— Ll dz 
j. l(z-«) 2 {z-Pfj 

Note that all the derivatives of i j/ afl (z) exist and are equal to zero if z < a 
or z^fi. In order to show that/„(x) is a good function, we differentiate 
/„(x) m times. Since a (m) ( nt - nx) is nonzero only when |x| — 1 < |/f < |x| + 1, 
we have 

=| f o (0 ( n) m o (m) (nt - nx) n exp(- t 2 Jn 2 ) dt 

< n m+1 max )o- (m) (z)| exp{- (|x| - l) 2 /n 2 } 

poo 

x 0 + ( 1*1 + 1) 2 }" (i + >T" i/o(«)i * = o(i*r M ) 

J — <X) 

as |x| — > co for all M. This shows that /„(x) is, indeed, a good function. 
Next, let us consider 

f Mx)K(x)dx- r fo(x)K(x) dx 

J -co J -oo 
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where K(x) is an arbitrary good function. Substituting (11.15) into this 
expression, we have 

^(y) dyj f 0 ( x )K(x)dx 

< r , slL /o(,)exp (-5){ /c ('-9- x( ' ) } d ' 

~l-/ o(,)K(0 { 1 - exp (^)} d, i 

l/o(0l{- max |K'(x)|ldr+ f \f 0 (i)K{t)\— 2 dt 
J -<X> ( n |x-f|<l J J -<» n 



If 00 A If 00 

»J-J /o(<)l (iTFf d ' + ^J_J /o 



(,)I 0T 7f dt 



as n-+cc, where A and B are constants. The above calculation shows that 

/»co »co 

lim f n {x)K{x)dx = f 0 (x)K(x)dx 

B-^CO J - QO J -oo 

for any good function K(x). Consequently, the sequence /„(*) defines a 
generalized function /(x) and /(x) can be considered equal to / 0 (x) from 
the definition of equality between two generalized functions. Furthermore, 
the derivative and Fourier transform of f(x) can be considered equal to 
those of/ 0 (x) under certain conditions. Let us assume that (1 + |x| 2 ) _w /o'(*) 
is also absolutely integrable from - oo to + oo for some N, then we have 

f f 0 '(x)K(x)dx = - f f 0 (x)K'(x)dx 

J -CO J -oo 

A comparison of this relation with (II. 5) shows that 

in the sense of equality between two generalized functions. To show that 
the Fourier transform F(y) of/(x) is equal to the Fourier transform F 0 (y ) 
of/o(*) when FqCj) exists, we have only to prove that 

f F( y )k(y)dy= f F 0 (y)k(y)dy 

J -CO J -00 
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holds for any good function k(y). This is obvious from the following 
calculation: 

f F(y)k(y)dy= lim f F n {y)k (y) dy 

J -oo n-»co J -oo 

/»CO /• CO 

= lim /„(*)K(x)dx = f 0 (x)X(x) dx 

B-* 00 J — OO J — CO 

/•OO /»O0 «0O 

F 0 (y)e i2 ”“’dyK(x)dx= \ F 0 (y)k(y)dy 

where K(x) is the Fourier transform of /r(>0 and use is made of (11.12) for 
the second equality. 

The above discussion shows for most calculations of interest that it is 
not necessary to distinguish between an ordinary function and the corre- 
sponding generalized function defined by (II. 1 5). We shall henceforth omit 
the subscript 0 for distinguishing ordinary functions from generalized ones. 

Let/(x) and g(x) be generalized functions defined by sequences of good 
functions /„(x) and g n (x), respectively, such that 

/•CO 

K(x)= f(t)g„(x - t)dt 

J — CO 



exists for all n, and that the sequence h n (x ) defines a generalized function 
/i(x). This h(x) is called the convolution of /(x) and g(x) and indicated by 




f(t) g (x - t ) dt 



(11.17) 



The integrand is the product of two generalized functions. It is worth 
noting that if we arbitrarily select two generalized functions, their product 
may not necessarily be defined. This is because there is no guarantee that 
the limit 



/•oo 

lim f„{x)g„{x)K{x) 



dx 



exists for every good function K(x) even though f„(x ) g n {x) is a good 
function. 

The Fourier transform H(y) of h(x) is given by the product of the Fourier 
transforms of /(x) and g(x). This can be seen as follows: 

/•CO /» oo 

H (>-) = lim f(t) g n ( x - t) dt e ~ j2nxy dx 

n-*o 5 J —oo J — oo 
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/•CO ACO 

= lim mg.(()e- J2 ’ u * a ’ dtdi 

n-* oo J —oo J — co 

= I'm f” P J,®e- w « = fW(:W (11.18) 

n - * oo J —oo */ —co 

If/(x) = <5(a) and the sequence g„(x ) defines a generalized function g(x), 
then the sequence 

h n(x)= f S(t)g n (x ~t)dt = g„(x) 

J — 00 



defines the same generalized function g(x), and we have 



9(x)= f S(t)g(x-t) 

J — co 



(II.19) 



Since the Fourier transform of 5 (a) is 1, Eq. (11.18) gives a trivial result 
G(y ) = (?(>>) in this case. 

If we set g( a) = 5(a) in (11.19) and use 5(a) = 5(- a), we have 



5(t)= r 5(0 5(t + T) 

J —co 



( 11 . 20 ) 



Similarly, if we set g(x) equal to I from - 6 to 9 and zero otherwise, we 
have 



*(0)= i = /*/(') dt 



( 11 . 21 ) 



where 6 is a nonzero positive quantity. 

\f f(x) is a generalized function for each value of the parameter t and 
/(a) is another generalized function such that for any good function K( x) 



then we say 



lim f /, (a) K (a) dx = f /(a) K (a) dx (11.22) 

t-*c J — 00 J — CO 



lim/,(x) = /(x) 



(11.23) 



Here c may be finite or infinite. Let F,(y) and F(y) be the Fourier transforms 
of ft( x ) and /(*)> then 

f* oo ACO 

F(y)k(y)d i — I f(x)K(x)dx 

J — CO J —CO 

= lim f f t (a) K. (a) dx = lim f F t (y)k(y)dy 
t~*C J - oo r-»c J - 00 
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or equivalently 



lim F t (y) = F(y) 



(11.24) 



In other words, the Fourier transform of the limit of generalized functions 
is equal to the limit of the Fourier transforms of the generalized functions. 

Let x(r) be a function of time such that a generalized function x T (t) can 
be defined by 

x T (/) = x(t), — T < t < T 

= 0, |T| ( ' 5) 

then the autocorrelation function of x(f) is given by 



R x (r) = lim 



2 T _ 



x T (t) a T (t + t) dt 



(11.26) 



Note that /? x (t) is an even function of t since a change in the sign of t does 
not change the value of the integral on the right-hand side. The Fourier 
transform of x r (x) is given by 



X T (f)=r x T (t)e-Mdl 

J — CO 

from which we have 

A OO A CO 

X T (f)*T*(f)= a T (P)e- j2 * f6 dQ a T (t)e i2xft dt 

J — OO J - 03 



(11.27) 



Multiplying both sides by e }2nfx and integrating from — oo to co with 
respect to f we obtain 

A CO ACO AGO A OO 

|AV(/)| 2 e 12 *'’ d/= l x T (t) x T (8) dt d0 df 

J ~~ 0 0 J — CD J — CO J — 00 

= f j* x T (t) x T (6) S(6 — t — x) d9 dt 

J — CO J — CO 

ACO 

= x T (t) x T (t + t) dt 

J — CO 

Dividing by 2T and taking the limit of 7’->oo, a comparison with (11.26) 
shows that 



K,M= f \x(f)\ 2 e n ’ ! ' df 

J - co 



(11.28) 



where 



|a(/)| 2 = lim ~-\X T (f)\ : 

T- oo 1 1 



(11.29) 
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The function [x(/)| 2 is called the power spectral density of x(f). The inverse 
transform of (11.28) gives 

i*t/)i 2 =r r (H.30) 

J — oo 

Since /? x (t) is an even function of t, (11.30) can be rewritten as 

/•co 

|x (/) | 2 = 2 J R x (t) cos 2nfx dx 

Similarly, since |x(/)| 2 is an even function of / from the above equation, 
(11.28) can be written in the form 

/•co 

H i (t) = 2 I \x(f)\ 2 cos 2nfx df 

If x(f) represents a stationary random process which is ergodic (time 
averages and ensemble averages are equal), then R x (x) can also be written 
in the form 

/•CO /» CO 

*,(t)= x l x 2 p(x l )p( x 2 \x i ;T)dx l dx 2 (11.31) 




where x t and x 2 indicate the values of x(r) at times t apart, p(x j) is the 
probability density function that x(r) is in the vicinity of x u and p(x 2 \x t ; t) 
is the conditional probability density function that x(t) is in the vicinity of 
x 2 at a time x after it was in the vicinity of x t . 

In much the same way, the cross-correlation function between two 
functions x(t) and ,y(/) is defined by 

i r 

R xy (x) = hm — x T (t) y T (t + t) dt (11.32) 

r-oo 2.1 J _ T 

and its Fourier transform is called the cross-power spectral density. When 
x(f) and y(t ) represent stationary random processes which are jointly 
ergodic, /^(t) can be written in the form 

**,(*)= f f x i y l p(x l )p(y i \x l ;x)dx 1 dy i (11.33) 

J — OO J — CO 

where p(y t | x 2 \x) is the conditional probability density function that y(/) 
is in the vicinity y l at a time r after x(i) was in the vicinity of jcj. 

One of the most important concepts introduced in this appendix is that 
of equality between generalized functions. Whenever we have a function 




■i' 

j 
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representing an impulse such as a hammer blow or a surge of electric current, 
we are not interested in the detailed shape of the function but rather the con- 
sequence of such a blow applied to some physical system, i.e., the con- 
sequence which can generally be expressed as the integral of the impulse 
function with a proper weighting function. As a result, if two impulse 
functions give identical results for all possible weighting functions, they 
can be considered equal. This is exactly the concept used to define the 
equality between generalized functions. 

Our mathematical equations are always idealizations for describing 
physical processes. The equality is examined by measurements which will 
never be able to test the values of functions at every point of space or at 
every instant of time. Instead, the examination will be made between the 
weighted averages over a small area in the vicinity of the point or over a small 
time or frequency interval. Thus, the equality we defined for generalized 
functions is more natural and more appropriate in the discussion of physical 
processes than the usual equality which demands the same values at each 
point of space or time. Looking again at Section 9.3, we can see how natural 
the noise discussion can be made with the help of generalized functions. 
The application of generalized functions is not restricted to the discussion 
of noise; there are numerous fields in which generalized functions prove 
to be useful although we did not discuss them in this book. 




REFERENCES 



The reader may wish to supplement his reading by referring to original papers which 
introduce some of the ideas discussed in this book; these will be given here. 

The Smith chart was originally introduced by 
Smith, P. H., Transmission Line Calculator. Electronics 12, 29-31 (1939). 

The equivalent circuit of two-port networks discussed in Section 1.3 is due to 

Weissfloch, A., Kreisgeometrische Vierpoltheorie und ihre Bedeutung fur Messtechnik 
und Schaltungstheorie des Dezimeter- und Zentimeterwellengebietes. Hochfrequenz. u. 
Elek. 61, 100-123 (1943). 

Power waves were first used by 

Penfield, Jr., P., Noise in Negative Resistance Amplifiers. IRE Tram. CT-7, 166-170 
(1960). 

A detailed discussion has been presented by 

Kurokawa, K., Power Waves and The Scattering Matrix. IEEE Trans. MTT-13, 194-202 
(1965). 

For proof of Helmholtz’s theorem, the author consulted 

Morse, P. M., and Feshbach, H., “Methods of Theoretical Physics.” McGraw-Hill, 
New York, 1953. 

For supplementing Chapter 2, the following two books are highly recommended: 
Stratton, J. A., “Electromagnetic Theory.” McGraw-Hill, New York, 1941. 

Panofsky, W. K. H., and Phillips, M., “Classical Electricity and Magnetism.” Addison- 
Wesley, Reading, Massachusetts, 1955. 

A theory of waveguides was first discussed by 

Strutt (Lord Rayleigh), J. W., On the Passage of Electric Waves through Tubes or the 
Vibrations of Dielectric Cylinders. Phil. Mag. 43, 125-132 (1897). 

Most of the fundamental concepts of waveguides, such as modes and cutoff frequencies, 
were established in this paper (Strutt); however, our discussion is largely based on 

Slater, J. C., “Microwave Electronics.” Van Nostrand, Princeton, New Jersey, 1950. 

This is the best book that the present author has read in the field of microwave circuit*, 
and his understanding of microwave circuits was greatly influenced by it. 

A good reference book for eigenvalue problems is ' j 

Courant, R., and Hilbert, D., “Methods of Mathematical Physics,” Vol. I. Wik#| 
(Interscience), New York, 1953. ■$* 



426 






References 



All 



The discussion of Bessel functions has been restricted to a minimum in the preceding 
chapters; those who wish to learn more about these functions may begin with 

McLachlan, N. W., “Bessel Functions for Engineers.” Oxford Univ. Press (Clarendon), 
London and New York, 1934. 

The first paper to discuss the effect of wall losses in waveguides using a perturbation 
method is 

Papadopoulus, V. M., Propagation of Electromagnetic Waves in Cylindrical Waveguides 
with Imperfectly Conducting Walls. Quart. J. Mech. Appl. Math. 7, 325-334 (1954). 

The set of functions which Slater used in his book for the series expansion of the 
magnetic field in cavities was found to be incomplete by 

Teichmann, T., and Wigner, E. P., Electromagnetic Field Expansions in Loss-free Cavities 
Excited through Holes. J. Appl. Phys. 24, 262-267 (1953) 

The discussion of resonant cavities in Section 4.2 is based on 

Kurokawa, K., The Expansions of Electromagnetic Fields in Cavities. IRE Trans. 
MTT-6, 178-187 (1958). 

A detailed discussion of waveguide junctions is presented by 

Montgomery, C. G., Dicke, R. H., and Purcell, E. M., “Principles of Microwave 
Circuits.” McGraw-Hill, New York, 1948. 

This is probably the first book devoted to the theory of microwave circuits. 

Nonreciprocal circuits using ferrites were introduced in the microwave field by 

Hogan, C. L., The Ferromagnetic Faraday Effect at Microwave Frequencies and Its 
Applications— The Microwave Gyrator. Bell System Tech. J. 31, 1-31 (1952). 

The theory of T-junction circulators in Section 5.8 is originally based on 

Auld, B. A., The Synthesis of Symmetrical Waveguide Circulators. IRE Trans. MTT-7, 
238-246 (1959). 

The discussion of distributed couplings in Section 6.1 is based on 
Haus, H. A., Coupling of Modes of Propagation. MIT, Cambridge, Massachusetts. 

Internal memorandum, copies of which are no longer available. 

A concise discussion of periodic structures can be found in 
Watkins, D. A., “Topic in Electromagnetic Theory.” Wiley, New York, 1958. 

The canonical form of noisy n-port networks was published by 

Haus, H. A., and Adler, R. B., Canonical Form of Linear Noisy Networks. IRE Trans. 
CT-5, 161-167(1958). 



428 



References 



% • 
T '^ 



The same theory is also presented by 

Haus, H. A., and Adler, R. B., “Circuit Theory of Linear Noisy Networks.” Wiley, 
New York, 1959. 

These authors used the exchangeable noise measure for their discussion of amplifier 
noise performance. The noise measure, which includes the noise contribution originating 
in and reflected back to the load, was first introduced by 

Kurokawa, K., Actual Noise Measure of Linear Amplifiers. Proc. IRE 49, 1391-1397 
(1961). 

The operating noise temperature, which includes the load noise contribution, is defined 
in IRE Standards on Electron Tubes issued in Proc. IEEE (March 1963). 

Statistical noise theory has not been discussed in this book; those who wish to acquire 
background information are referred to 

Davenport, Jr., W. B., and Root, W. L., “Random Signals and Noise.” McGraw-Hill, 
New York, 1958. 

The unconditional stability was first discussed by 

Llewellyn, F. B., Some Fundamental Properties of Transmission Systems. Proc. IRE 
40, 271-283 (1952). 



The concept of unilateral gain was introduced by 
Mason, S. J., Power Gain in Feedback Amplifier. IRE Trans. CT-1, 20-25 (1954). 



The operating principle of tunnel diodes is based on 

Esaki, L., New Phenomenon in Narrow Germanium p — n Junctions. Phys. Rev. 109, 
603-604 (1958). 

In order to write the qualitative discussion of semiconductors in Section 7.5, the present 
author consulted 

Shive, J. N., “Semiconductor Devices.” Van Nostrand, Princeton, New Jersey, 1959. 

The Manley-Rowe relation was published by 

Manley, J. M., and Rowe, H. E., Some General Properties of Nonlinear Elements— I. 
General Energy Relations. Proc. IRE 44, 904-913 (1956). 

The derivation given in Section 7.6 is, however, based on 
Manley, J. M., and Rowe, H. E., General Energy Relations in Nonlinear Reactances. 
Proc. IRE 47, 21 15-2116 (1959). 

The dynamic Q of variable capacitances was first used by 

Kurokawa, K., and Uenohara, M., Minimum Noise Figure of the Variable Capacitance 
Amplifier. Bell System Tech. J. 40, 695-722 (1961). 







References 



429 



A simple method for measuring the dynamic Q and the adjustment of parametric 
amplifier circuits was presented by 

Kurokawa, K., Use of Passive Circuit Measurements for Adjustment of Variable 
Capacitance Amplifiers. Bell System Tech. J. 41, 361-381 (1962). 

For the discussion of electron beams, the author consulted Chapter 3 in 

Smullin, L. D., and Haus, H. A., “Noise in Electron Devices.” Wiley, New York, 1959. 
Our n p and n s are closely related to n and S, originally introduced by H. A. Haus: 

IT = n P /4izB, S = n 8 /4nB 

The comparison between magnitudes of kinetic and electromagnetic powers in based on 

Louisell, W. H., and Pierce, J. R., Power Flow in Electron Beam Devices. Proc. IRE 
43, 425-427 (1955). 

The distributed couplings of three waves, one circuit, and two space-charge waves was 
discussed in 

Pierce, J. R., Traveling-Wave Tubes. Bell System Tech. J. 29, 608-669 (1950). 






Parameters 4(2 C, C, b used in the above paper are given in terms of our parameters by 

(2j?«|Ci3| 2 ) 2 ' 3 fie 1 1 2(/*8iCi 3 | 2 ) 1 / 3 

The derivation of linear differential equations for nonlinear systems, such as oscillators, 
is known as equivalent linearization. The first comprehensive discussion, which is a 
translation of Russian monographs published in 1934 and 1937, was given by 

Krylov, N., and Bogoliubov, N., “Introduction to Nonlinear Mechanics.” Princeton 
Univ. Press, Princeton, New Jersey, 1943. 

The discussion of generalized functions presented in Appendix II is based on 

Lighthill, M. J., “Introduction to Fourier Analysis and Generalized Functions.” 
Cambridge Univ. Press, London and New York, 1959. 

Two good papers on noise in oscillators are 

Edson, W. A., Noise in Oscillators. Proc. IRE 48, 1454-1466 (1960). 

Mullen, J. A., Background Noise in Nonlinear Oscillators. Proc. IRE 48, 1467-1473 
(1960). 

The discussion in Section 9.3 is based on 

Kurokawa, K., Noise in Synchronized Oscillators. Trans. /EFFMTT-16, 234-240 (1968). 

The following are general texts covering many branches of the subject : 

Southworth, G. C., “Principles and Applications of Waveguide Transmission.” Van 
Nostrand, Princeton, New Jersey, 1950. 

Ginzton, E. L., “Microwave Measurements.” McGraw-Hill, New York, 1957. 

Collin, R. E., “Field Theory of Guided Waves.” McGraw-Hill, New York, 1960. 



References 1 

1962 0, S ' and WHINNERY ’ J ‘ R ’’ “ Fields and Waves in Modern Radio.” Wiley, New York ; 

Johnson, C. C., “Field and Wave Electromagnetics.” McGraw-Hill, New York, 1965. 

J966 LIN ” R E ' F ° Undations for Microwav e Engineering.” McGraw-Hill, New York I 

INDEX 




A 

Actual power, 36 
Adjoint matrix, 204 
AM noise, 394 
Ampere-turn, 67 

Attenuation constant, 23, 162, 198 
Autocorrelation function, 423 
Available gain, 289 
Available power, 34 



B 

Beam -coupling factor, 361 
Bessel function, 141 
hyperbolic, 379 
Bilinear transformation, 23 
Boltzmann constant, 300 
Brillouin zone, 286 



C 



Canonical form, 296 
Canonical transformation, 296 
Characteristic admittance, 7 
Characteristic impedance, 7, 93 
Circulator, 248 

Completeness, 85, 107, 110, 120 
Conduction band, 322 
Conductivity, 69 



Conservation 
of charge, 67 
of energy, 70 
Convolusion, 421 
Coordinate transformation, 205 
Coulomb, 67 
Coupled mode, 256 
Coupling matrix, 258 
Cross-correlation function, 424 
Cross-power spectral density, 424 
Cutoff, 89 

Cylindrical coordinate, 61 

D 

Degeneracy, 140 
removal of 148, 160, 167 
Delta-function, 415 
Depletion layer capacitance, 325 
Diagonal matrix, 211 
Diffusion capacitance, 326 
Directional coupler, 168, 169 
Displacement current, 66 
Divergence, 48 
Dominant mode, 90 
Dynamic Q, 340 



E 

Eigenfunction, 99 



SUBJECT INDEX 



432 



Eigenfunction approach, 84 
Eigenvalue, 99, 206 
infinite growth of, 107, 399 
problem, 99, 206 
Eigenvector, 206 
Electron beam, 346 
Ensemble average, 294 
Equivalent circuit 
of electrode gap, 364 
of linear one-port generator, 33 
of lossless reciprocal three-port 
network, 233 

of lossless reciprocal two-port 
network, 33 

of one-port resonant cavity, 187 
of reciprocal two-port network, 31 
of tunnel diode, 330 
of two-port resonant cavity, 191 
of unconditionally stable amplifier, 313 
of waveguide window, 97, 149 
Exchangeable gain, 344 
Exchangeable noise figure, 344 
Exchangeable noise measure, 344 
Exchangeable power, 34 
External Q, 186 

F 

Faraday rotation, 254 
Fast wave, 353 

Fermi-Dirac distribution law, 321 
Fermi level, 321 
Ferrite, 240 
FM noise, 394 

Foster’s reactance theorem, 228 
Fourier expansion, 110 
Fourier transformation, 416 
Functional, 104 

G 

Gauss’s theorem, 49, 51 
Generalized function, 415 
Good function, 415 
Gradient, 54 

Group velocity, 94, 138, 358 



H 

Helmholtz’s theorem, 61 
Hole, 322 

I 

Idealized model, 1, 4 
Idler, 337 
Incident wave, 12 
Injection locking, 386-389 
Integration by parts, 57 
Inverse matrix, 202 
Isolator, 254 

J 

Junction capacitance, 326 

K 

Kinetic energy, 351 
Kinetic power, 350 
Kinetic voltage, 350 
Klystron, 364-367 

L 

Linearity, 5 
Loaded Q, 190 

Lossless condition, 224, 225, 277, 372 
Low-field loss, 247 

M 

Magic T, 253 
Magnetic induction, 63 
Magnetization, 242 
Manley-Rowe relations, 334 
Matched load, 98 
Matrix, 200-217 

Maximum-minimum property, 41 1 
Maxwell’s equations, 67, 69 
Movable short, 98 

N 

/i-Dimensional vector, 204 
Natural precession frequency, 246 
Net power, 1 1 




Subject Index 

Neumann function, 141 
1// Noise, 396 
Noise figure, 302 
Noise temperature, 301 
Nonlinear reactance, 333 
Nonlinearity, 333 
Nonsingular matrix, 203 
Nonsingular transformation, 297 
Normalization, 107, 118, 125 
Normalized admittance, 21 
Normalized impedance, 14, 93 

O 

<a-fi Diagram, 286 
Operating noise measure, 301 
Operating noise temperature, 300 
Optimum noise measure, 303 
of electron beam device, 378 
of parametric amplifier, 342 
of tunnel diode amplifier, 331 
Orthogonality, 78, 101, 116, 125 
Orthonormality, 125, 126, 128 
Oscillator, 380 
Overcoupling, 190 

P 

Parametric amplifier, 337 
Passband, 281 
Perfect conductor, 85 
Periodic structure, 274-282 
Permeability, 69 
Perturbation method, 164, 264 
Phase constant, 6 
Phase velocity, 11, 95, 138 
Photon, 319 

Piecewise-continuous function, 107, 119, 
174 

Plane wave, 76 
Plasma frequency, 353 
reduced, 354 
Plasma wavelength, 364 
Piosson’s equation, 58 
Polarization, 79-80 
Positive-definiteness, 212 
Power-loss method, 164 



433 

Power spectral density, 424 
Power transmission coefficient, 37 
Power wave, 35 
Poynting vector, 70 
complex, 72 

Principle of superposition, 5 
Projection, 42 
Propagation constant, 6 
Pump, 336 

Q 

(J-Curve, 173 

R 

Reciprocal condition, 220, 222, 276 
Reference impedance, 222 
Reflected wave, 12 
Reflection coefficient, 12 
power, 37 
power wave, 36 
Rellich’s theorem, 401, 407 
Resonant cavity, 170 
Rieke diagram, 385 
Right-handed screw rule, 43 
Rotary vane type attenuator, 147 
Rotation, 51 

S 

Saturation magnetization, 246 
Scalar products, 42 
Scattering matrix, 222 
Self-adjoint matrix, 204 
Semiconductor, 322-326 
intrinsic, 322 
n-type, 323 
/j-type, 323 

Semipositive-definiteness, 215 
Separation of variable, 140 
Similarity transformation, 205 
Simultaneous diagonalization, 215, 216, 
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Simultaneous matching, 311-312, 345 
Singular matrix, 203 
Shot noise, 330, 397 
Skin depth, 76 
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Slow wave, 353 

Slow-wave structure, 288 

Smith chart, 1 5 

Space charge wave, 355 

Space harmonics, 287 

Spectral representation, 210, 264, 279, 280 

Square-integrable function, 107 

Standard noise temperature, 302 

Standing wave detector, 98 

Standing wave method, 21 

Standing wave ratio, 20 

Stokes’s theorem, 52 

Stopband, 281 

Surface current, 87 

Symmetrical Y junction, 234-240 

T 

Tapered section, 99 
Tensor permeability, 241 
Total matching, 239 
Trace, 207 

Transducer gain, 289, 300 
Transfer matrix, 276 
Transposed matrix, 203 
Transverse electric mode, 90 
Transverse electromagnetic mode, 123 
Transverse magnetic mode, 123 
Traveling wave, 1 1 
Traveling wave tube, 367 



Tunnel diode, 318-329 
Tunnel effect, 328 

U 

Unconditional stability, 308, 311 
Undercoupling, 190 
Unilateral gain, 315 
Unilateralization, 318 
Unitary matrix, 204 
Unloaded Q, 186 
Up-converter, 336 

V 

Valence band, 322 

Variational expression, 105, 116, 127-129, 
145, 152, 167, 169, 173, 175, 196, 197, 
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Vector, 40-63 
Vector product, 43 

W 

Waveguide, 84 
impedance, 92 
window, 97, 149 
Wavelength, 10, 90, 138 
Weber, 67 

Well-behaved function, 1 30 
White noise, 389 




